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BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001] This invention relates to the field of biology. In a particular embodiment, it relates 

to peptides, polynucleotides, and compositions useful to monitor or elicit an immune 
response to selected antigens, and methods of identifying such , peptides and 
polynucleotides. 

Related Art 

[0002] HLA class I molecules are expressed on the surface of almost all nucleated cells. 

Following intracellular processing of antigens, epitopes from the antigens are presented as 
a complex with the HLA class I molecules on the surface of such cells. CTL recognize the 
peptide-HLA class I complex, which then results in the destruction of the cell bearing the 
HLA-peptide complex directly by the CTL and/or via the activation of non-destructive 
mechanisms e.g., the production of interferon, that inhibit viral replication. 

[0003] Human Immunodeficiency Virus. Acquired immunodeficiency syndrome (AIDS) 

caused by infection with human immunodeficiency virus- 1 (HTV-1) represents a major world 
health problem. Estimates indicate that about 16,000 people worldwide are infected with HIV 
each day. 

[0004] The development of anti-viral drugs has been a major advancement in reducing viral loads 

in HIV infected patients. Highly active retroviral therapy (HAART) has been shown to reduce 
viremia to nearly undetectable levels. However, current drug therapies are not practicable as a 
long term solution to the HIV epidemic. HAART therapy is severely limited due to poor tolerance 
for the drugs and the emergence of drug-resistant virus. Moreover, replication competent HIV 
persists in the lymphoid tissue of patients who have responded to HAART, thus serving as a 
reservoir of virus. Lastly, current anti-retroviral drug therapies have little impact upon the global 
epidemic: almost 90% of the world's HIV infected population resides within countries lacking 
financial resources for these drugs. Thus, a need exists for an efficacious vaccine to both prevent 
and treat HIV infection. 
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[0005] Virus-specific, human leukocyte antigen (HLA) class I-restricted cytotoxic T lymphocytes 

(CTL) are known to play a major role in the prevention and clearance of virus infections in vivo 
(Oldstone et al., Nature 321:239, 1989; Jamieson et al., J. Virol. 61:3930, 1987; Yap et al, Nature 
273:238, 1978; Lukacher et al., J. Exp. Med. 160:814, 1994; McMichael et al., N. Engl. J. Med 
309:13, 1983; Sethi et al., J. Gen. Virol. 64:443, 1983; Watari et al., J. Exp. Med. 165:459, 1987- 
Yasukawa et al., J. Immunol. 143:2051, 1989; Tigges et al., J. Virol. 66:1622, 1993; Reddenhase 
et al., J. Virol. 55:263, 1985; Quinnan et al., N. Engl. J. Med. 307:6, 1982). 
[0006] While immune correlates of protective immunity against HIV infection are not well 

defined, there is a growing body of evidence that suggests CTL are important in controlling HIV 
infection. HIV-specific CTL responses can be detected early in infection and the appearance of 
the responses corresponds to the time in infection at which initial viremia is reduced (Pantaleo et 
al, Nature 370:463, 1994; Walker et al., Proc. Natl. Acad. Sci. 86:9514, 1989). In addition, HIV 
replication in infected lymphocytes can be inhibited by incubation with autologous CTL (see, e g 
Tsubota et al., J. Exp. Med. 169:1421, 1989). These data are supported by recent studies thai 
indicate CTL are required for controlling viral replication in a SIV/rhesus animal model (Schmitz 
et al, Science 283:857, 1999), and additionally supported by studies that demonstrate that CTL 
exert selective pressure on HTV populations as evidenced by the eventual predominance of viruses 
with amino acid replacements in those regions of the virus to which CTL responses are directed 
(see, e.g., Borrow et al, Nature Med. 3:205-211, 1997; Price et al, Proc. Nat. Acad. Sci. 
94:12890-1895, 1997; Koenig et al, Nature Med. 1:330-336, 1995; and Haas et al, J. Immunol 
157:4212-4221, 1996). 

[0007] ■ Virus-specific T helper lymphocytes are also known to be critical for mamtaining effective 

immunity in chronic viral infections. Historically, HTL responses were viewed as primarily 
supporting the expansion of specific CTL and B cell populations; however, more recent data 
indicate that HTL may directly contribute to the control of virus replication. For example, a 
decline in CD4 + T cells and a corresponding loss in HTL function characterize infection with HIV 
(Lane etal, New EnglJ. Med. 313:79, 1985). Furthermore, studies in HTV infected patients have 
also shown that there is an inverse relationship between virus-specific HTL responses and viral 
load, suggesting that HTL play a role in viremia (see, e.g., Rosenberg et al, Science 278 1447 
1997). 

[0008] A fundamental challenge in the development of an efficacious HIV vaccine is the 

heterogeneity observed in HIV. The virus, like some other infectious agents including 
retroviruses, rapidly mutates during replication resulting in the generation of virus that can escape 
anti-viral therapy and immune recognition (Borrow et al., Nature Med. 3:205, 1997). In addition, 
HTV can be classified into a variety of subtypes that exhibit significant sequence divergence (see 
eg., Lukashov et al, AIDS 12:S43, 1998). In view of the heterogeneous nature of HIV, and the 
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heterogeneous immune response observed with HIV infection, induction of a multi-specific 
cellular immune response directed simultaneously against multiple HIV epitopes appears to be 
important for the development of an efficacious vaccine against HIV. There is a need to establish 
such vaccine embodiments which elicit immune responses of sufficient breadth and vigor to 
prevent and/or clear HIV infection. 

[0009] Hepatitis B Virus. Chronic infection by hepatitis B virus (HBV) affects at least 

5% of the world's population and is a major cause of cirrhosis and hepatocellular 
carcinoma (Hoofiiagle, J., N. Engl. J. Med. 323:337, 1990; Fields, B. and Knipe, D., In: 
Fields Virology 2:2137, 1990). The World Health Organization lists hepatitis B as a 
leading.,cause of death worldwide, close behind chronic pulmonary disease, and more 
prevalent than AIDS. Chronic HBV infection can range from an asymptomatic carrier 
state to continuous hepatocellular necrosis and inflammation, and can lead to 
hepatocellular carcinoma. 

[0010] The immune response to HBV is believed to play an important role in controlling 

hepatitis B infection. A variety of humoral and cellular responses to different regions of 
the HBV nucleocapsid core and surface antigens have been identified. T cell mediated 
immunity, particularly involving class I human leukocyte antigen-restricted cytotoxic T 
lymphocytes (CTL), is believed to be crucial in combatting established HBV infection. 

[0011] Several studies have emphasized the association between self-limiting acute 

hepatitis and multispecific CTL responses (Penna, A. et aL, J. Exp. Med. 174:1565, 1991; 
Nayersina, R. et al., J. Immunol. 150:4659, 1993). Spontaneous and interferon-related 
clearance of chronic HBV infection is also associated with the resurgence of a vigorous 
CTL response (Guidotti, L. G. et aL, Proc. Natl. Acad. Sci. USA 91:3764, 1994). In all 
such cases the CTL responses are polyclonal, and specific for multiple viral proteins 
including the HBV envelope, core and polymerase antigens. By contrast, in patients with 
chronic hepatitis, the CTL activity is usually absent or weak, and antigenically restricted. 

[0012] The crucial role of CTL in resolution of HBV infection has been further 

underscored by studies using HBV transgenic mice. Adoptive transfer of HBV-specific 
CTL into mice transgenic for the HBV genome resulted in suppression of virus 
replication. This effect was primarily mediated by a non-lytic, lymphokine-based 
mechanism (Guidotti, L. G. et aL, Proc. Natl. Acad. Sci. USA 91:3764, 1994; Guidotti, L. 
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G., Guilhot, S., and Chisari, F. V. J. Virol. 68:1265, 1994; Guidotti, L. G. et al., /. Virol 
69:6158, 1995; Gilles, P. N., Fey, G., and Chisari, F. V., J. Virol. 66:3955, 1992). 
[0013] As is the case for HLA class I restricted responses, HLA class H restricted T cell 

responses are usually detected in patients with acute hepatitis, and are absent or weak in 
patients with chronic infection (Chisari, F. V. and Ferrari, C, Annu. Rev. Immunol. 13:29, 
1995). HLA Class H responses are tied to activation of helper T cells (HTLs) Helper T 
lymphocytes, which recognize Class H HLA molecules, may directly contribute to the 
clearance of HBV infection through the secretion of cytokines which suppress viral 
replication (Franco, A. et al., J. Immunol. 159:2001,1997). However, their primary role in 
disease resolution is believed to be mediated by inducing activation and expansion of 
virus-specific CTL and B cells. 
[0014] In view of the heterogeneous immune response observed with HBV infection, 

induction of a multi-specific cellular immune response directed simultaneously against 
multiple epitopes appears to be important for the development of an efficacious vaccine 
against HBV. There is a need to establish vaccine embodiments that elicit immune 
responses that correspond to responses seen in patients that clear HBV infection. 
Epitope-based vaccines appear useful. 

[0015] Hepatitis C Virus. Hepatitis C virus (HCV) infection is a global human health 

problem with approximately 150,000 new reported cases each year in the U.S. alone. 
HCV is a single stranded RNA virus, and is the etiological agent identified in most cases 
of non-A, non-B post-transfusion and post-transplant hepatitis, and is a common cause of 
acute sporadic hepatitis (Choo et al, Science 244:359, 1989; Kuo et al, Science 244:362, 
1989; and Alter et al, in: Current Perspective in Hepatology, p. 83, 1989). It is estimated 
that more than 50% of patients infected with HCV become chronically infected and, of 
those, 20% develop cirrhosis of the liver within 20 years (Davis et al, New Engl J. Med. 
321:1501, 1989; Alter et al, in: Current Perspective in Hepatology, p. 83, 1989; Alter et 
al, New Engl J. Med. 327:1899, 1992; and Dienstag, J. L. Gastroenterologl 85:430, 
1983). Moreover, the only therapy available for treatment of HCV infection is interferon- 
oc. Most patients are unresponsive, however, and among the responders, there is a high 
recurrence rate within 6-12 months of cessation of treatment (Liang et al, J. Med. Virol. 
40:69, 1993). Ribaviron, a guanosine analog with a broad spectrum activity against many 
RNA and DNA viruses, has been shown in clinical trials to be effective against chronic 
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HCV infection when used in combination with interferon- a (see, e.g., Poynard et al, 
Lancet 352:1426-1432, 1998; Reichard et al, Lancet 351:83-87, 1998) However, the 
response rate is still well below 50%. 

[0016] Virus-specific, human leukocyte antigen (HLA) class I-restricted cytotoxic T 

lymphocytes (CTL) are known to play a major role in the prevention and clearance of 
virus infections in vivo (Oldstone et al, Nature 321:239, 1989; Jamieson et al., J. Virol. 
61:3930, 1987; Yap et al, Nature 273:238, 1978; Lukacher et al, J. Exp. Med. 160:814, 
1994; McMichael et al, N. Engl. J. Med. 309:13, 1983; Sethi et al, J. Gen. Virol. 64:443, 
1983; Watari et al, J. Exp. Med. 165:459, 1987; Yasukawa et al, J. Immunol 143:2051, 
1989; Tigges et al, J. Virol. 66:1622, 1993; Reddenhase et al, J. Virol 55:263, 1985; 
Quinnan et al, N. Engl J. Med. 307:6, 1982). 

[0017] In view of the heterogeneous immune response observed with HCV infection, 

induction of a multi-specific cellular immune response directed simultaneously against 
multiple HCV epitopes appears to be important for the development of an efficacious 
vaccine against HCV. There is a need, however, to establish vaccine embodiments that 
elicit immune responses that correspond to responses seen in patients that clear HCV 
infection. 



[0018] Human Papillomavirus. Human papillomavirus (HPV) is a member of the 

papillomaviridae, a group of small DNA viruses that infect a variety of higher vertebrates. 
More than 80 types of HPVs have been identified. Of these, more than 30 can infect the 
genital, tract. Some types, generally types 6 and 11, may cause genital warts, which are 
typically benign and rarely develop into cancer. Other strains of HPV, "cancer- 
associated", or "high-risk" types, can more frequently lead to the development of cancer. 
The primary mode of transmission of these strains of HPV is through sexual contact. 

[0019] The main manifestations of the genital warts are cauhflower-like condylomata 

acuminata that usually involve moist surfaces; keratotic and smooth papular warts, usually 
on dry surfaces; and subclinical "flat" warts, which are found on any mucosal or cutaneous 
surface (Handsfield, H, Am. J. Med. 102(5A): 16-20, 1997). These warts are typically 
benign but are a source of inter-individual spread of the virus (Ponten, J. & Guo, Z., 
Cancer Surv. 32:201-29, 1998). At least three HPV strains associated with genital warts 
have been identified: type 6a (see, e.g., Hofrnann, K.J., et al, Virology 209(2): 506-51%, 
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1995), type 6b (see, e.g., Hofmann et al, supra) and type 1 1 (see, e.g., Dartmann, K. et al., 
Virology 151(1):124-130, 1986). 
[0020] Cancer-associated HPVs have been linked with cancer in both men and women; 

they include, but are not limited to, HPV-16, HPV-18, HPV-31, HPV-45, HPV-33 and 
HPV-56. Other HPV strains, including types 6 and 1 1 as well as others, e.g., HPV-5 and 
HPV-8, are less frequently associated with cancer. The high risk types are typically 
associated with the development of cervical carcinoma and premalignant lesions of the 
cervix in women, but are also associated with similar malignant and premalignant lesions 
at other anatomic sites within the lower genital or anogenital tract. These lesions include 
neoplasia of the vagina, vulva, perineum, the penis, and the anus. HPV infection has also 
been associated with respiratory tract papillomas, and rarely, cancer, as well as abnormal 
growth or neoplasia in other epithelial tissues. See, e.g. VIROLOGY, 2 nd Ed, Fields et al, 
Eds. Raven Press, New York, 1990, Chapters 58 and 59, for a review of HPV association 
with cancer. 

[0021] The HPV genome consists of three functional regions, the early region, the late 

region, and the "long control region". The early region gene products control viral 
replication, transcription and cellular transformation They include the HPV El and E2 
proteins, which play a role in HPV DNA replication, and the E6 and E7 oncoproteins, 
which are involved in the control of cellular proliferation. The late region include the 
genes that encode the structural proteins LI and L2, which are the major and minor capsid 
proteins, respectively. The "long control region" contains such sequences as enhancer and 
promoter regulatory regions. 
[0022] HPV expresses different proteins at different stages of the infection, for example 

early, as well as late, proteins. Even in latent infections, however, early proteins are often 
expressed and are therefore useful targets for vaccine-based therapies. For example, high- 
grade dysplasia and cervical squamous cell carcinoma continue to express E6 and E7, 
which therefore can be targeted to treat disease at both early and late stages of infection. 
[0023] Treatment for HPV infection is often unsatisfactory because of persistence of virus 

after treatment and recurrence of clinically apparent disease is common. The treatment 
may require frequent visits to clinics and is not directed at elimination of the virus but at 
clearing warts. Because of persistence of virus after treatment, recurrence of clinically 
apparent disease is common. 
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[0024] Thus, a need exists for an efficacious vaccine to both prevent and treat HPV 

infection and to treat cancer that is associated with HPV infection. Effective HPV 
vaccines would be a significant advance in the control of sexually transmissable infections 
and could also protect against clinical disease, particularly cancers such as cervical cancer. 
(see, e.g., Rowen, P. & Lacey, C, Dermatologic Clinics 16(4):835-838, 1998). 
[0025] Virus-specific, human leukocyte antigen (HLA) class I-restricted cytotoxic T 

lymphocytes (CTL) are known to play a major role in the prevention and clearance of 
virus infections in vivo (Oldstone et al, Nature 321:239, 1989; Jamieson et al., J. Virol. 
61:3930, 1987; Yap et al, Nature 273:238, 1978; Lukacher et ah, J. Exp. Med. 160:814, 
1994; McMichael et al, N. Engl. J. Med. 309:13, 1983; Sethi et al., J. Gen. Virol. 64:443^ 
1983; Watari et al, J. Exp. Med. 165:459, 1987; Yasukawa et al, J. Immunol. 143:2051,' 
1989; Tigges et al, J. Virol. 66:1622, 1993; Reddenhase et al, J. Virol 55:263, 1985; 
Quinnan et al, N. Engl J. Med. 307:6, 1982). 
[0026] Virus-specific T helper lymphocytes are also known to be critical for maintaining 

effective immunity in chronic viral infections. Historically, HTL responses were viewed 
as primarily supporting the expansion of specific CTL and B cell populations; however, 
more recent data indicate that HTL may directly contribute to the control of virus 
replication. For example, a decline in CD4 + T cells and a corresponding loss in HTL 
function characterize infection with HTV (Lane et al, New Engl J. Med. 313:79, 1985). 
Furthermore, studies in HTV infected patients have also shown that there is an inverse 
relationship between virus-specific HTL responses and viral load, suggesting that HTL 
plays a role in viremia (see, e.g., Rosenberg et al, Science 278:1447, 1997). 
[0027] The development of vaccines with prophylactic and therapeutic efficacy against 

HPV is ongoing. Early vaccine development was hampered by the inability to culture 
HPV. With the introduction of cloning techniques and protein expression, however, some 
attempts have been made to stimulate humoral and CTL response to HPV (See, e.g., 
Rowen, P. & Lacey, C, Dermatologic Clinics 16(4):835-838 (1998)). Studies to date, 
however, have been inconclusive. 
[0028] Activation of T helper cells and cytotoxic lymphocytes (CTLs) in the development 

of vaccines has also been analyzed. Lehtinen, M., et al. for instance, has shown that some 
peptides from the E2 protein of HPV type 16 activate T helper cells and CTLs (Biochem. 
Biophys. Res. Commun. 209(2):541-6 (1995). Similarly, Tarpey et al, has shown that 
some peptides from HPV type 1 1 E7 protein can stimulate human HPV-specific CTLs in 
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vitro (Immunology 81:222-227 (1994)) and Borysiewicz et al. have reported a 
recombinant vaccinia virus expressing HPV 16 and HPV 17 E6 and E7 that stimulated 
CTL responses in at least one patient (Lancet 347:1347-1357, 1996). 

[0029] Plasmodium falciparum and Malaria. Malaria, which is caused by infection with 

the parasite Plasmodium falciparum (PF), represents a major world health problem. 
Approximately 500 million people in the world are at risk from the disease, with 
approximately 200 million people actually harboring the parasites. An estimated 1 to 2 
million deaths occur each year due to malaria. (Miller et al, Science 234:1349, 1986). 
[0030] Fatal outcomes are not confined to first infections, and constant exposure is 

apparently a prerequisite for mamtaining immunity. Naturally acquired sterile immunity 
is rare, if it exists at all. Accordingly, major efforts to develop an efficacious malaria 
vaccine have been undertaken. 
[0031] Human volunteers injected with irradiated PF sporozoites are resistant to 

subsequent sporozoite challenges, which demonstrates that development of a malaria 
vaccine is indeed immunologically feasible. Furthermore, these immune individuals 
developed a vigorous response, including antibodies, and cytotoxic T lymphocyte (CTL) 
and helper T lymphocyte (HTL) components, directed against multiple antigens. 
Reproducing the breadth and multiplicity of this response in a vaccine, however, is a task 
of large proportions. The epitope approach, as described herein, may represent a solution 
to this challenge, in that it allows the incorporation of various antibody, CTL and HTL 
epitopes, from various proteins, in a single vaccine composition. 
[0032] Anti-sporozoite antibodies are by themselves, in general, not completely 

efficacious in clearing the infection (Egan et al, Science 236:453, 1987). However, high 
concentrations of antibodies directed against the repeated region of the major B cell 
antigen of the sporozoite/circumsporozoite protein (CSP) have been shown to prevent liver 
cell infection in certain experimental models (Egan et al., Science 236:453, 1987; 
Potocnjak, P. et al, Science 207:71, 1980). The present inventors have shown that 
constructs encompassing CSP-repeat B cell epitopes and the optimized helper epitope 
PADRE™ (San Diego, CA) are highly immunogenic, and can protect in vitro against 
sporozoite invasion in both mouse and human liver cells, and protect mice in vivo against 
live sporozoite challenge (Franke et al, Vaccine 17:1201-1205, 1999) 
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[0033] PF-specific CD4 + T cells also have a role in malarial immunity beyond providing 

help for B cell and CTL responses. Experiments by Renia et al (Renia, et al, Proc. Natl. 
Acad. Sci. USA 88:7963, 1991) demonstrated that HTLs directed against the Plasmodium 
yoelli CS protein could in fact adoptivley transfer protection against malaria. 
[0034] Considerable data implicate CTLs in protection against pre-erythrocytic-stage 

malaria. CD8 + CTLs can eliminate Plasmodium berghei- or Plasmodium yoe/n-infected 
mouse hepatocytes from in vitro culture in a major histocompatibility complex (MHC)- 
restricted and antigen-restricted manner (Hoffman et al., Science 244:1078-1081, 1989; 
Weiss et al, J. Exp. Med. 171:763-773, 1990). Further, it has also been shown that the 
immunity that developed in mice vaccinated with irradiated sporozoites is also dependent 
upon the present of CD8+ T cells. These T cells accumulate in inflammatory liver 
infiltrates subsequent to challenge. Passive transfer of circumsporozoite (CSP)-specific 
CTL clones as long as three hours after inoculation of sporozoites (i.e., after the parasites 
have left the bloodstream and infected liver cells) were capable of protecting animals 
against infection (Romero et al. , Nature 341 :323, 1989). 
[0035] It is notable that CTL-restricted responses directed against a single antigen are 

insufficient to protect mice with different MHC alleles, and a combination of multiple 
antigens was required even to protect mice from the most common laboratory strains of 
Plasmodium. These data indicate that a combination of epitopes form several antigens is 
necessary to elicit a protective CTL response. 
[0036] Indirect evidence that CTLs are important in protective immunity against Pf in 

humans has also accumulated. It has been reported that cytotoxic CD8 + T cells can be 
identified in humans immunized with PF sporozoites (Moreno, et al., Int. Immunol. 3:997, 
1991). Further, humans immunized with irradiated sporozoites or naturally exposed to 
malaria can generate a CTL response to the pre-erythrocytic-stage antigens, CSP, 
sporozoite surfece protein 2 (SSP2), liver-stage antigen-1 (LSA-1), and exported protein-1 
(Exp-1) (see, e.g. Malik et al, Proc. Natl. Acad. Sci. USA 88, 3300-3304, 1991; Doolan et 
al, Int. Immunol. 3:511-516, 1991; Hill et al, Nature 360:434-439, 1992). Additionally, 
there is evidence that the polymorphism within the CSP may be the result of selection by 
CTLs of parasites that express variant forms (MCutchan and Water, Immunol. Lett. 25:23- 
26, 1990). This is based on the observation that the variation is nonsynonymous at the 
nucleotide level, thereby indicating selective pressure at the protein level. The 
polymorphism primarily maps to identified CTL and T helper epitopes (Doolan et al, Int. 
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Immunol. 5:27-46, 1993); and CTL responses to some of the parasite variants do not cross- 
react (Hill et al, supra). Finally, the MHC class I human leukocyte antigen (HLA)-Bw53 
has been associated with resistance to severe malaria in The Gambia, and CTLs to a 
conserved epitope restricted by the HLA-Bw53 allele have been identified on P. 
falciparum LSA-1 (Hill et ah, Nature 352:595-600, 1991; Hill et al., Nature 340:434-439, 
1992). Since HLA-Bw53 is found in 15%-40% of the population of sub-Saharan Africa 
but in less than 1% of Caucasians and Asians, these data suggest evolutionary selection on 
the basis of protection against severe malaria. 

[0037] Thus, antibody, and both HLA class I and class H restricted responses directed 

against multiple sporozoite antigens appear to be involved in generating protective 
immunity to malaria. Furthermore, several important antigenic epitopes against which 
humoral and cellular immunity is focused have already been exactly delineated. 

[0038] In view of the heterogeneous immune response observed with PF infection, 

induction of a multi-specific cellular immune response directed simultaneously against 
multiple PF epitopes appears to be important for the development of an efficacious 
vaccine against PF. There is a need, however, to establish vaccine embodiments that elicit 
immune responses that correspond to responses seen in patients that clear PF infection. 

[0039] Epitope-Based Vaccines. The use of epitope-based vaccines has several 

advantages over current vaccines. The epitopes for inclusion in such a vaccine are to be 
selected from conserved regions of viral or tumor-associated antigens, in order to reduce 
the likelihood of escape mutants. The advantage of an epitope-based approach over the 
use of whole antigens is that there is evidence that the immune response to whole antigens 
is directed largely toward variable regions of the antigen, allowing for immune escape due 
to mutations. Furthermore, immunosuppressive epitopes that may be present in whole 
antigens can be avoided with the use of epitope-based vaccines. 

[0040] Additionally, with an epitope-based vaccine approach, there is an ability to 

combine selected epitopes (CTL and HTL) and additionally to modify the composition of 
the epitopes, achieving, for example, enhanced immunogenicity. Accordingly, the 
immune response can be modulated, as appropriate, for the target disease. Similar 
engineering of the response is not possible with traditional approaches. 
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[0041J Another major benefit of epitope-based immune-stimulating vaccines is then- 

safety. The possible pathological side effects caused by infectious agents or whole protein 
antigens, which might have their own intrinsic biological activity, is ehminated. 

[0042] An epitope-based vaccine also provides the ability to direct and focus an immune 

response to multiple selected antigens from the same pathogen. Thus, patient-by-patient 
variability in the immune response to a particular pathogen may be alleviated by inclusion 
of epitopes from multiple antigens from that pathogen in a vaccine composition. A 
"pathogen" may be an infectious agent or a tumor associated molecule. 

[0043] One of the most formidable obstacles to the development of broadly efficacious 

epitope-based immunotherapeutics has been the extreme polymorphism of HLA 
molecules. In the past, effective non-genetically biased coverage of a population has been 
a task of considerable complexity; such coverage has required that epitopes be used 
specific for HLA molecules corresponding to each individual HLA allele. Therefore, 
unpractically large numbers of epitopes would been required in order to cover ethnically 
diverse populations. Recently, methods have been developed that allow the identification 
of epitopes that bind multiple HLA molecules. Therefore, epitope-based vaccines can be 
designed that contain epitopes which, either individually or in combination, bind a greater 
number of HLA molecules. The resulting epitope-based vaccines have a greater breadth 
of population coverage across one or more continents and even worldwide. 



[0044] Variation in Epitopes of Infectious Agents. A challenge in the development of 

effective vaccines against infectious agents such as hepatitis B virus (HBV) (47, 60) 
hepatitis C virus (HCV) (61-63), human papilloma virus (HPV) (64, 65) Plasmodium 
falciparum (66), and human immunodeficiency virus (HTV-1) is the protein sequence 
variation associated with different isolates. This variation is the result of gene sequence 
mutations. When such mutations occur in regions encoding epitopes recognized by 
cytotoxic T-lymphocytes (CTL), they provide a mechanism for escape of the agent from 
immune system control. 

[0045] HTV-1 represents an infectious agent with an especially high frequency of 

sequence variation. The sequence variation associated with HTV-1 proteins from related 
isolates, members of the same clades or types, as well as unrelated isolates, is well 
documented (1). Viral escape from CTL induced as the result of natural infection or 
vaccines was documented in nonhuman primate models where the mechanism behind this 
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escape was mutation of the primary anchor residues in dominant CTL epitopes (5-9). 
Viral escape from HTV-specific CTL has also been strongly implied by data obtained from 
HTV-1 infected individuals whose disease status change, including the transition from 
acute to chronic infection (10, 11), loss of stable control of viral replication and 
subsequent progression to AIDS (4, 12) or mother-to-child transmission (13). Thus, HTV- 
1 genetic and protein sequence variation represent a significant challenge to immune 
system-based control of viral replication, both within infected individuals and within 
populations. 

[0046] While the public health need for a vaccine against HTV-1 is well recognized and 

accepted, the genetic variation of HTV-1 isolates represents a highly significant obstacle 
(1, 14-16). Several strategies have been proposed, some of which include: 

(1) Designing vaccines on HTV-1 types prevalent within small, well defined 
populations or geographical regions, such as individual countries or regions, and 
producing multiple different vaccines for exclusive use within these countries or 
regions (16). 

(2) Use of HTV-1 ancestral or consensus sequences based on HTV types present in 
larger target populations, such as groups of neighboring countries or continents 
(15,17-19). 

(3) Incorporation of viral gene products obtained from multiple different virus 
isolates, representing diversely different types or clades, into a single 'multi- 
valent' vaccine. 



[0047] Related vaccine design concepts that incorporate many of the advantages 

associated with the approaches described above are the use of highly conserved regions or 
epitopes derived from these regions as the basis of the vaccine. The logic behind this 
approach is that conserved regions of the viral genome are those that have been 
maintained through the evolution of HTV-1 because changes impact gene product function 
and general viral fitness. This theory is consistent with analyses of HTV-1 protein 
sequence data which demonstrated that CTL epitopes are concentrated in conserved 
regions and that regions devoid of CTL epitopes are me most variable (20). Additional 
support comes from published reports describing CTL responses, induced as the result of 
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natural infection or vaccination, that recognize viral proteins or epitopes common to viral 
isolates from diverse types or clades (21-26). Broad function CTL responses are also 
known to be correlated with slower progression to AIDS, at least for certain carefully 
studied populations (27, 28). Despite these reports and the clustering of CTL epitopes in 
conserved regions of HIV-1 gene products, amino acid sequence variation of analogous 
regions and epitopes from different viral isolates, both within the same type or clade and 
from different types, remains significant. There are currently no rules guiding the 
selection of conserved regions of CTL epitopes for use in vaccines other than the use of 
amino acid sequence identity (29). 
[0048] A clear understanding of how CTL recognize pathogen infected cells has emerged 

over the past decade. It is now well established that small fragments of pathogen-derived 
proteins are generated, defined as peptide epitopes generally 8-11 amino acids in length, 
which bind to HLA-A, -B, or -C (human Class I Major Histocompatability Molecules) 
molecules expressed on the cell surface. Sequencing of naturally processed peptides 
bound to HLA molecules provided a means to identify the amino acid residues required 
for allele-specific epitope-peptide binding (30-32). Data obtained from X-ray 
crystallographic analysis of HLA-epitope peptide complexes, allowed for the 
identification and structural characterization of 'binding pockets' within the peptide 
binding cleft of HLA molecules. More refined epitope anchor motif definitions were then 
developed using data obtained from in vitro peptide-MHC binding assays. It is now well 
known that the main anchor residues typically occur at position 2 and the carboxyl 
terminus of peptides 8-1 1 amino acids in length, thus positions 8, 9, 10 or 1 1 (33-40). The 
definition of epitope peptide binding anchor motifs is the key to most, if not all, epitope 
prediction methods. 

[0049J Initial CTL epitope identification methods were developed using common HLA 

alleles, such as HLA-A2.1. Motifs defined using different HLA molecules were found to 
be similar and this lead to the definition of HLA supertype families (41). The biological 
effect of this supertype relationship was first demonstrated for HTV-1 epitopes in a study 
where the HLA-A3 and -All epitope peptide binding patterns repertoires were 
demonstrated to be overlapping, not only with each other but also with HLA- A3 1, -A33 
and -A*6801 (42). This binding specificity was defined as the HLA-A3 supertype. A 
significant overlap in peptide binding patterns was also demonstrated amongst several 
serologically distant HLA-B alleles (43, 44), and multiple HLA-A2 alleles (45, 46), 
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resulting in the definition of the HLA-B7 and HLA-A2 supertype families. Recognition of 
epitopes by CTL in a supertype manner has since been demonstrated to occur naturally in 
infectious diseases and cancer (47-53). 
[0050] While only two positions within CTL epitopes are typically characterized as the 

primary binding anchor positions, the amino acids that can serve as the anchor residues are 
more variable. The preferred and tolerated amino acids that can serve as anchor residues 
for the HLA-A2, -A3 and -B7 supertype families of epitopes are listed in Table 1. It is 
possible for analogous HTV-1 epitope peptides derived from different isolates, which 
differ with respect to the amino acids used as anchor residues, to bind to HLA molecules 
similarly. This type of variation can be as conserved since it is likely that CTL produced 
against one epitope would recognize the related epitope. Thus, variation limited to 
changes in anchor residues that result in sufficient epitope peptide binding to HLA 
molecules does not result in immune escape from CTL. Epitopes that contain this type of 
variation can be identified using the appropriately designed motif search algorithms. 
[0051] The TCR of CTL has been reported to be somewhat flexible or promiscuous with 

respect to recognition of epitope peptides bound to HLA molecules. For HTV-1, this 
flexibility was demonstrated as CTL recognition of related, but slightly variable, epitopes 
by single clones of CTL produced following natural infection (54, 55). Similar flexibility 
of CTL epitope recognition was demonstrated using rhesus macaques and natural infection 
with STV or immunization (56, 57). This observation is not unique to HTV-1 and STV but 
rather the TCR appears to have evolved to allow promiscuous recognition of peptide 
epitope bound to MHC molecules (58). 
[0052] Selective replacement of certain amino acids in CTL epitope peptides, amino acids 

thought to represent TCR contact points, is not only tolerated but can increase the 
recognition of the epitopes by CTL clones (59). The types of amino acid substitutions that 
can be incorporated, typically amino acids that are similar in chemical properties are best 
tolerated, and their positions, independent of primary anchor positions, within a selected 
number of CTL epitopes from tumor associated antigens were also defined. 
[0053] For HTV-1 and other infectious agents, reproducible methods for predicting the 

CTL recognition of related variant epitopes that occur amongst isolates have not been 
developed. Nor have methods for identifying CTL epitopes that are most likely to induce 
broadly functional responses when used in vaccine. Thus, there exists a need to develop 
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such methods to overcome the challenge associated with protein sequence variation in 
HIV and other infectious agents. 
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SUMMARY OF THE INVENTION 

[0054] The present invention is directed to methods for selecting a variant of a peptide 

epitope which induces a CTL response against another variant(s) of the peptide epitope, by 
determining whether the variant comprises only conserved residues, as defined herein, at 
non-anchor positions in comparison to the other variants). 

[0055] In some embodiments, antigen sequences from a population of an infectious agent, 

said antigens comprising variants of a peptide epitope, are optionally aligned (manually or 
by computer) along their length, preferably their full length. Variant(s) of a peptide 
epitope (preferably naturally occurring variants), each 8-11 amino acids in length and 
comprising the same MHC class I supermotif or motif, are identified manually or with the 
aid of a computer. In some embodiments, a variant is optionally chosen which comprises 
preferred anchor residues of said motif and/or which occurs with high frequency within 
the population of variants. In other embodiments, a variant is randomly chosen. The 
randomly or otherwise chosen variant is compared to from one to all the remaining 
variant(s) to determine whether it comprises only conserved residues in the non-anchor 
positions relative to from one to all the remaining variants). 
[0056] The present invention is also directed to variants identified by the methods above; 

peptides comprising such variants; nucleic acids encoding such variants and peptides; cells 
comprising such variants, and/or peptides, and/or nucleic acids; compositions comprising 
such variants, and/or peptides, and/or nucleic acids, and/or cells; as well as therapeutic and 
diagnostic methods for using such variants, peptides, nucleic acids, cells, and 
compositions. 
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BRIEF DESCRIPTION OF THE DRAWINGS/FIGURES 

[0057] FIGS. 1 A-1E. Recognition of variant peptides by CTL generated against a single 

epitope. Variant peptides were identified from 167 HIV strains for 5 HTV epitopes, 3 
HLA-A2 restricted (Env 134, A, Gag 386, B, and Vpr 62, C) and 2 HLA-A11 restricted 
(Pol 98, D, and Env 47, E). These are listed according to their relationship to a previously 
determined parent (P) into single anchor substitutions (A), single non-anchor substitutions 
(NA) or multiple substitutions (M). Binding of each variant peptide is also shown. The 
number of viral sequences containing each variant peptide is shown in the column labeled 
# Isolates, and is reported for the total sequences, Clade B sequences (B), and Clade C 
sequences (C). Finally, the ability of CTL primed against the parent peptide to recognize 
the variant peptides is shown in the bar graphs. 

[0058] FIGS. 2A-2C. Characterization of the peptide-specific T cell lines. A. FACS 

analysis of the TCRs expressed by peptide -stimulated cells after 0, 1, and 5 peptide 
stimulations, using a panel of commercially available mAb for mouse TCR 2-14. B-C. 
Peptide affinity. Parent and variant peptides were titrated against CTL that had been 
stimulated 5 times with the parent peptide. 

[0059] FIGS. 2A-2B. Recognition of a panel of variant peptides by PBL from an HIV- 

infected individual. 

[0060J FIG 4. Prediction of immunological conservation. Gag 271 variants and their 

binding are shown, along with the number of isolates that express each variant. 
Immunological recognition was predicted for each variant based on two different choices 
in the inununizing peptide. On the right, the immunogenicity for each variant is shown. 

DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

[0061] The invention can be better understood with reference to the following definitions: 

[0062] An "antigen" refers to a polypeptide encoded by the genome of an infectious agent, 

or other another source, but preferably an infectious agent in the present invention. 
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Examples of HIV antigens include Env, Gag, Nef, Pol, Tat, Rev, Vif, Vpr, Vpu, pl7, p24, 
p2, p7, pi, p6, Protease, RT, Integrase, and gpl60 (preferably Env, Gag, Nef, Pol, Tat, 
Rev, Vif, Vpr, Vpu). Examples of HBV antigens include Core, Env, and Pol. Examples 
of HCV antigens include Core, El, E2, Nsl, Ns2, Ns3, Ns4, and Ns5. Examples of HPV 
antigens include El, E2, E3, E4, E5, E6, E7, LI, and L2. Examples of Plasmodium 
falciparum antigens include CSP, SSP2, Expl, and LSAL 
[0063] Throughout this disclosure, "binding data" results are often expressed in terms of "KWs." 

ICso is the concentration of peptide in a binding assay at which 50% inhibition of binding of a 
reference peptide is observed. Given the conditions in which the assays are run (i.e., limiting HLA 
proteins and labeled peptide concentrations), these values approximate K D values. Assays for 
determining binding are described in detail, e.g. 9 in PCT publications WO 94/20127 and WO 
94/03205, and other publications such Sidney et al., Current Protocols in Immunology 18.3.1 
(1998); Sidney, et al., J. Immunol. 154:247 (1995); and Sette, et al, Mol. Immunol 31:813 (1994). 
It should be noted that IC 5 o values can change, often dramatically, if the assay conditions are 
varied, and depending on the particular reagents used (e.g., HLA preparation, etc.). For example, 
excessive concentrations of HLA molecules will increase the apparent measured IC 50 of a given 
ligand. 

[0064] Alternatively, binding is expressed relative to a reference peptide. Although as a particular 

assay becomes more, or less, sensitive, the IC 50 ! s of the peptides tested may change somewhat, the 
binding relative to the reference peptide will not significantly change. For example, in an assay 
run under conditions such that the IC 50 of the reference peptide increases 10-fold, the IC 50 values 
of the test peptides will also shift approximately 10-fold. Therefore, to avoid ambiguities, the 
assessment of whether a peptide is a good (i.e. high), intermediate, weak, or negative binder is 
generally based on its IC 50 , relative to the IC 50 of a standard peptide. The Tables included in this 
application present binding data in a preferred biologically relevant form of IC 50 nM. 

[0065] Binding may also be determined using other assay systems including those using: live 

cells (e.g. 9 Ceppellini et al, Nature 339:392 (1989); Christnick et al 9 Nature 352:67 (1991); Busch 
et al, Int. Immunol 2:443 (1990); Hill et al 9 J. Immunol 147:189 (1991); del Guercio et al 9 J. 
Immunol 154:685 (1995)), cell free systems using detergent lysates (e.g. 9 Cerundolo et al 9 J. 
Immunol 21:2069 (1991)), immobilized purified MHC (e.g. 9 Hill et al 9 J. Immunol 152, 2890 
(1994); Marshall et al 9 J. Immunol 152:4946 (1994)), ELISA systems (e.g. 9 Reay et al, EMBO J. 
11:2829 (1992)), surface plasmon resonance (e.g. 9 Khilko et al 9 J. Biol Chem. 268:15425 (1993)); 
high flux soluble phase assays (Hammer et al 9 J. Exp. Med. 180:2353 (1994)), and measurement 
of class I MHC stabilization or assembly (e.g. 9 Ljunggren et al 9 Nature 346:476 (1990); 
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Schumacher et al, Cell 62:563 (1990); Townsend et al, Cell 62:285 (1990); Parker et al, J. 
Immunol 149:1896(1992)). 
[0066J As used herein, "high affinity" with respect to HLA class I molecules is defined as binding 

with an ICso or K D value, of 50 nM or less, "intermediate affinity" is binding with an IC 50 or K D 
value of between 50 and about 500 nM, weak affinity is binding with an IC 50 or K D value of 
between about 500 and about 5000 nM. "High affinity" with repect to binding to HLA class H 
molecules is defined as binding with an IC S0 or K D value of 100 nM or less; "intermediate affinity- 
is binding with an IC S0 or K D value of between about 100 and about 1000 nM. 
[0067] A "computer" or "computer system" generally includes: a processor and related computer 

programs; at least one information storage/retrieval apparatus such as a hard drive, a disk drive or 
a tape drive; at least one input apparatus such as a keyboard, a mouse, a touch screen, or a 
microphone; and display structure, such as a screen or a printer. Additionally, the computer may 
include a communication channel in communication with a network. Such a computer may 
include more or less than what is listed above. 
[0068] "Cross-reactive binding" indicates that a peptide is bound by more than one HLA 

molecule; a synonym is degenerate binding. 
[0069] A "cryptic epitope" elicits a response by immunization with an isolated peptide, but the 

response is not cross-reactive in vitro when intact whole protein, which comprises the epitope, is 
used as an antigen. 

[0070] The term "derived" when used to discuss an epitope is a synonym for "prepared." A 

derived epitope can be isolated from a natural source, or it can be synthesized in accordance with 
standard protocols in the art. Synthetic epitopes can comprise artificial amino acids "amino acid 
mimetics," such as D isomers of natural occurring L amino acids or non-natural amino acids such 
as cyclohexylalanine. A derived/prepared epitope can be an analog of a native epitope. 

[0071] A "diluent" includes sterile liquids, such as water and oils, including those of petroleum, 

animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and 
the like. Water is a preferred diluent for pharmaceutical compositions. Saline solutions and 
aqueous dextrose and glycerol solutions can also be employed as diluents, particularly for 
injectable solutions. 

[0072] A "dominant epitope" is an epitope that induces an immune response upon immunization 

with a whole native antigen (see, e.g., Sercarz, et al., Annu. Rev. Immunol 11:729-766, 1993). 
Such a response is cross-reactive in vitro with an isolated peptide epitope. 

[0073] An "epitope" is the collective features , of a molecule, such as primary, secondary and 

tertiary peptide structure, and charge, that together form a site recognized by an immunoglobulin, 
T cell receptor or HLA molecule. Alternatively, an epitope can be defined as a set of amino acid 
residues which is involved in recognition by a particular immunoglobulin, or in the context of T 
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cells, those residues necessary for recognition by T cell receptor proteins and/or Major 
Histocompatibility Complex (MHC) receptors. Epitopes are present in nature, and can be 
isolated, purified or otherwise prepared/derived by humans. For example, epitopes can be 
prepared by isolation from a natural source, or they can be synthesized in accordance with standard 
protocols in the art. Synthetic epitopes can comprise artificial amino acids, "amino acid 
mimetics," such as D isomers of naturally-occurring L amino acids or non-naturally-occuring 
amino acids such as cyclohexylalanine. Throughout this disclosure, epitopes may be referred to in 
some cases as peptides. The variants of the invention are set forth in Tables 6-9 and Figures 1 A-4. 

[0074] It is to be appreciated that proteins or peptides that comprise a variant of the invention as 

well as additional amino acid(s) are still within the bounds of the invention. In certain 
embodiments, the peptide comprises a fragment of an antigen. A "fragment of an antigen" or 
"antigenic fragment" or simply "fragment" is a portion of an antigen which has 100% identity with 
a wild type antigen or naturally-ocurring variant thereof. The fragment may or may not comprise 
an epitope of the invention. The fragment may be less than or equal to 600 amino acids, less than 
or equal to 500 amino acids, less than or equal to 400 amino acids, less than or equal to 250 amino 
acids, less than or equal to 100 amino acids, less than or equal to 85 amino acids, less than or equal 
to 75 amino acids, less than or equal to 65 amino acids, or less than or equal to 50 amino acids in 
length. In certain embodiments, a fragment is e.g., less than 101 or less than 51 amino acids in 
length, in any increment down to 5 amino acids in length. For example, the fragment may be 5, 6, 
7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 
34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 
60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100 amino acids in length. 

[0075] In certain embodiments, there is a limitation on the length of a peptide of the invention. 

The embodiment that is length-limited occurs when the protein/peptide comprising an epitope of 
the invention comprises a region (i.e., a contiguous series of amino acids) having 100% identity 
with a native sequence. In order to avoid the definition of epitope from reading, e.g. 9 on whole 
natural molecules, there is a limitation on the length of any region that has 100% identity with a 
native peptide sequence. Thus, for a peptide comprising an epitope of the invention and a region 
with 100% identity with a native peptide sequence, the region with 100% identity to a native 
sequence generally has a length of: less than or equal to 600 amino acids, often less than or equal 
to 500 amino acids, often less than or equal to 400 amino acids, often less than or equal to 250 
amino acids, often less than or equal to 100 amino acids, often less than or equal to 85 amino 
acids, often less than or equal to 75 amino acids, often less than or equal to 65 amino acids, and 
often less than or equal to 50 amino acids. In certain embodiments, an "epitope" of the invention 
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is comprised by a peptide having a region wilh less than 51 amino acids that has 100% identity to a 
native peptide sequence, in any increment down to 5 amino acids. 
[0076] Accordingly, peptide or protein sequences longer than 600 amino acids are within the 

scope of the invention, so long as they do not comprise any contiguous sequence of more than 600 
amino acids that have 100% identity with a native peptide sequence. For any peptide that has five 
contiguous residues or less that correspond to a native sequence, there is no limitation on the 
maximal length of that peptide in order to fall within the scope of the invention. It is presently 
preferred that a peptide of the invention (eg., a peptide comprising an epitope of the invention) be 
less man 600 residues long in any increment down to eight amino acid residues. 
[0077] A peptide epitope occurring with "high frequency" is one that occurs in at least 

30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, or at least 90% of 
the infectious agents in a population. A "high frequency" peptide epitope is one of the 
more common in a population, preferably the first most common, second most common, 
third most common, or fourth most common in a population of variant peptide epitopes. 
[0078] "Human Leukocyte Antigen" or "HLA" is a human class I or class H Major 

Histocompatibility Complex (MHC) protein (see, e.g., States, et al, IMMUNOLOGY, 8™ Ed., Lange 
Publishing, Los Altos, CA (1 994). 
[0079] An "HLA supertype or HLA family", as used herein, describes sets of HLA molecules 

grouped on the basis of shared peptide-binding specificities. HLA class I molecules that share 
somewhat similar binding affinity for peptides bearing certain amino acid motifs are grouped into 
such HLA supertypes. The terms HLA superfamily, HLA supertype family, HLA family, and 
HLA xx-like molecules (where «xx» denotes a particular HLA type), are synonyms. See Tables 1- 
4. 

[0080] As used herein, "high affinity" with respect to HLA class I molecules is defined as binding 

with an ICjo, or K D value, of 50 nM or less; "intermediate affinity" is binding with an IC 50 or K D 
value of between about 50 and about 500 nM; "weak affinity" is binding with an IC 50 or K D value 
between about 500 and about 5000 nM. "High affinity" with respect to binding to HLA class H 
molecules is defined as binding with an IC 50 or K D value of 100 nM or less; "intermediate affinity" 
is binding with an IC 50 or K D value of between about 100 and about 1 000 nM. See "binding data." 

[0081] An "ICso" is the concentration of peptide in a binding assay at which 50% inhibition of 

binding of a reference peptide is observed. Given ihe conditions in which the assays are run (i.e., 
limiting HLA proteins and labeled peptide concentrations), these values approximate K D values. 
See "binding data." 

[0082] The terms "identical" or percent "identity," in the context of two or more peptide 

sequences or antigen fragments, refer to two or more sequences or subsequences that are the same 
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or have a specified percentage of amino acid residues that are the same, when compared and 
aligned for maximum correspondence over a comparison window, as measured using a sequence 
comparison algorithm or by manual alignment and visual inspection. 

[0083] An "immunogenic" peptide or an "immunogenic" epitope or "peptide epitope" is a peptide 

that comprises an allele-specific motif or supermotif suclv that the peptide will bind an HLA 
molecule and induce a CTL and/or HTL response. Thus, immunogenic peptides of the invention 
are capable of binding to an appropriate HLA molecule and thereafter inducing a cytotoxic T 
lymphocyte (CTL) response, or a helper T lymphocyte (HTL) response, to the peptide. 

[0084] An "infectious agent" refers to a disease-causing microorganism, including viruses, 

bacteria, fungi, and protozoa against which a cellular immune response, preferably a CTL 
response, plays a role in acquired immunity. Examples of infectious agents include 
viruses such as human immunodeficiency virus (HIV), hepatitis B virus (HBV), hepatitis 
C virus (HCV), human papillomma virus (HPV), Influenza virus, Dengue virus, Epstein- 
Barr virus, bacteria such as Mycobacterium tuberculosis and Chlamydia, fungi such as Candida 
albicans, Cryptococcus neoformans, Coccidoides spp., Histoplasma spp, and Aspergillus 
fumigatis, protozoa such as Plasmodium spp,, including P. falciparum, Trypanosoma spp., 
Schistosoma spp., Leishmania spp and the like. Preferred infectious agents include HIV, HBV, 
HCV, HPV 5 Epstein-Barr virus, Plasmodium falciparum, Influenza virus and Dengue 
virus. 

[0085] The phrases "isolated" or "biologically pure" refer to material which is substantially or 

essentially free from components which normally accompany the material as it is found in its 
native state. Thus, isolated peptides in accordance with the invention preferably do not contain 
materials normally associated with the peptides in their in situ environment. An "isolated" epitope 
refers to an epitope that does not include the whole sequence of the antigen or polypeptide from 
which the epitope was derived. Typically the "isolated" epitope does not have attached thereto 
additional amino acids that result in a sequence that has 100% identity with a native sequence. 
The native sequence can be a sequence such as a tumor-associated antigen from which the epitope 
is derived. Thus, the term "isolated" means that the material is removed from its original 
environment {e.g., the natural environment if it is naturally occurring). For example, a naturally- 
occurring polynucleotide or peptide present in a living animal is not isolated, but the same 
polynucleotide or peptide, separated from some or all of the coexisting materials in the natural 
system, is isolated. Such a polynucleotide could be part of a vector, and/or such a polynucleotide 
or peptide could be part of a composition, and still be "isolated" in that such vector or composition 
is not part of its natural environment. Isolated RNA molecules include in vivo or in vitro RNA 
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transcripts of the DNA molecules of the present invention, and further include such molecules 
produced synthetically. 

[0086] "Major Histocompatibility Complex" or "MHC" is a cluster of genes that plays a role in 

control of the cellular interactions responsible for physiologic immune responses, hi humans, the 
MHC complex is also known as the human leukocyte antigen (HLA) complex. For a detailed 
description of the MHC and HLA complexes, see, Paul, FUNDAMENTAL IMMUNOLOGY, 3 rd 
Ed., Raven Press, New York (1993). 

[0087] The term "motif refers to a pattern of residues in an amino acid sequence of 

defined length, preferably a peptide of less than about 15 amino acids in length, or less 
than about 13 amino acids in length, usually from about 8 to about 13 amino acids (e.g., 8, 
9, 10, 11, 12, or 13) for a class I HLA motif and from about 6 to about 25 amino acids 
(e.g., 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25) for a class H 
HLA motif, which is recognized by a particular HLA molecule. Motifs are typically 
different for each HLA protein encoded by a given human HLA allele. These motifs often 
differ in their pattern of the primary and secondary anchor residues. See Tables 1-3. 

[0088] A "native" or a "wild type" sequence refers to a sequence found in nature. 

[0089] A "negative binding residue" or "deleterious residue" is an amino acid which, if present at 

certain positions (typically not primary anchor positions) in a peptide epitope, results in decreased 
binding affinity of the peptide for the peptide's corresponding HLA molecule. 

[0090] The term "peptide" is used interchangeably with "oligopeptide" in the present specification 

to designate a series of residues, typically L-amino acids, connected one to the other, typically by 
peptide bonds between the a-amino and carboxyl groups of adjacent amino acids. 

[0091] A "PanDR binding" peptide or "PADRE®" peptide (Epimmune, San Diego, CA) is a 

member of a family of molecules that binds more than one HLA class II DR molecule. The 
pattern that defines the PADRE® family of molecules can be referred to as an HLA Class II 
supermotif. A PADRE® molecule binds to HLA-DR molecules and stimulates in vitro and in vivo 
human helper T lymphocyte (HTL) responses. For a further definition of the PADRE® family, see 
copending application US serial Nos. 09/709,774, filed November 11, 2000; and 09/707,738, filed 
November 6, 2000; PCT publication Nos WO 95/07707, and WO 97/26784; U.S. Patent Nos. 
5,736,142 issued April 7, 1998; 5,679,640, issued October 21, 1997; and 6,413,935, issued July 2, 
2002. 

[0092] "Phannaceutically acceptable" refers to a generally non-toxic, inert, and/or physiologically 

compatible composition or component of a composition. 
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[0093] A "pharmaceutical excipient" or "excipient" comprises a material such as an adjuvant, a 

carrier, pH-adjusting and buffering agents, tonicity adjusting agents, wetting agents, preservatives, 
and the like. A "pharmaceutical excipient" is an excipient which is phannaceutically acceptable. 

[0094] A "primary anchor residue" is an amino acid at a specific position along a peptide 

sequence which is understood to provide a contact point between the immunogenic peptide and the 
HLA molecule. One, two or three, primary anchor residues within a peptide of defined length 
generally defines a "motif" for an immunogenic peptide. These residues are understood to fit in 
close contact with peptide binding grooves of an HLA molecule, with their side chains buried in 
specific pockets of the binding grooves themselves. In one embodiment of an HLA class I motif, 
the primary anchor residues are located at position 2 (from the amino terminal position) and at the 
carboxyl terminal position of a peptide epitope in accordance with the invention. The primary 
anchor positions for each motif and supermotif of HLA Class I are set forth in Tables 1-2. For 
example, analog peptides can be created by altering the presence or absence of particular residues 
in these anchor positions. Such analogs are used to modulate the binding affinity of an epitope 
comprising a particular motif or supermotif. A "preferred primary anchor residue" is an 
anchor residue of a motif or supermotif that is associated with optimal binding. Preferred 
primary anchor residues are indicated in bold-face in Tables 1-2. A tolerated primary 
anchor residue" is an anchor residue of a motif or supermotif that is associated with 
binding to a lesser extent than a preferred residue. Tolerated primary anchor residues are 
indicated in italicized text in Tables 1-2. 

[0095] "Promiscuous recognition" by a TCR is where a distinct peptide is recognized by the 

various T cell clones in the context of various HLA molecules. Promiscuous binding by an HLA 
molecule is synonymous with cross-reactive binding. 

[0096] A "protective immune response" or 'therapeutic immune response" refers to a CTL and/or 

an HTL response to an antigen derived from an antigen of an infectious agent, which in some way 
prevents or at least partially arrests disease symptoms, side effects or progression. The immune 
response may also include an antibody response which has been facilitated by the stimulation of 
helper T cells. 

[0097] By "ranking 55 the variants in a population of peptide epitopes is meant ordering 

each variant by its frequency of occurrance relative to the other variants. 

[0098] The term "residue" refers to an amino acid or amino acid mimetic incorporated into a 

peptide or protein by an amide bond or amide bond mimetic. 

[0099] A "secondary anchor residue" is an amino acid at a position other than a primary anchor 

position in a peptide which may influence peptide binding. A secondary anchor residue occurs at a 
significantly higher frequency amongst HLA-bound peptides than would be expected by random 
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distribution of anuno acids at a given position. A secondary anchor residue can be identified as a 
residue which is present at a higher frequency among high or intermediate affinity binding 
peptides, or a residue otherwise associated with high or intermediate affinity binding The 
secondary anchor residues are said to occur at "secondary anchor positions." For example, analog 
peptides can be created by altering the presence or absence of particular residues in these 
secondary anchor positions. Such analogs are used to finely modulate the binding affinity of an 
epitope comprising a particular motif or supermotif. The terminology "fixed peptide" is generally 
used to refer to an analog peptide that has changes in primary anchore position; not secondary 
[00100, A "subdominant epitope" is an epitope which evokes little or no response upon 

nnmunization with a whole antigen or a fragment of the whole antigen comprising a subdominant 
epitope and a dominant epitope, which comprise the epitope, but for which a response can be 
obtamed by immunization with an isolated peptide, and this response (unlike the case of cryptic 
epitopes) is detected when whole antigen or a fragment of the whole antigen comprising a 
subdommant epitope and a dominant epitope is used to recall the response in vitro or in vivo 
[00101] A "supermotif is a peptide binding specificity shared by HLA molecules encoded by two 

or more HLA alleles. Preferably, a supeimotif-bearing peptide is recognized with high or 
intermediate affinity (as defined herein) by two or more HLA antigens 
[00102, "Synthetic peptide" refers to a peptide that is abtained from a non-natural source e s . is 

man-made. Such peptides may be produced using such methods as chemical synthesis'or 
recombmant DNA technology. "Synthetic peptides" include "fusion proteins » 
[00103, As used herein, a "vaccine" is a composition used for vaccination, e.g., for prophylaxis or 

therapy, that comprises one or more peptides of the invention. There are numerous embodiments 
of vaccines in accordance with the invention, such as by a cocktail of one or more peptides- one or 
more peptides of the invention comprised by a polyepitopic peptide; or nucleic acids that encode 
such peptides or polypeptides, e. g ., a minigene that encodes a polyepitopic peptide. The "one or 
more peptides" can include any whole unit integer from 1-150, e.g., at least 1, 2, 3, 4, 5 6 7 8 9 
10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33,' 34 35 
36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60,' 65 
70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, or 150 or more peptides of 
the mventron. The peptides or polypeptides can optionally be modified, such as by lipidation 
addition of targeting or other sequences. HLA class I-binding peptides of the invention can be 
hnked to HLA class H-binding peptides, e.g., a PADRE® universal HTL-bindind peptide to ■ 
facilitate activation of both cytotoxic T lymphocytes and helper T lymphocytes. Vaccines can 
comprise peptide pulsed antigen presenting cells, e.g., dendritic cells. 
[00104, A 'Variant of a peptide epitope" refers to a peptide that is identified from a 

different viral strain at the same position in an aligned sequence, and that varies by one or 
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more amino acids from the parent peptide epitope. Examples of peptide epitope variants 
include those shown in Tables 6-9 and Figures 1A-4. A 'Variant of an antigen" refers to 
an antigen that comprises at least one variant of a peptide epitope. Examples of antigen 
variants include those listed by sequence and/or accession number in Tables 10-22. A 
•Variant of an infectious agent" refers to an infectious agent whose genome encodes at 
least one variant of an antigen. Variants of infectious agents are related viral, bacterial, 
fimagl, or protozoan strains or isolates that vary in sequence but cause the same disease 
symptoms. Examples of infectious agent variants include HTV Clade A, B, and C 
subtypes, HBV subtypes adr, ayr, adw, and ayw, HCV types 1, 2, 3, 4, 5, and 6, HPV 
strains 1-92 (preferably strains 16, 18, 31, 33, 45, 52, 56, and 58) (see Table 10, listing 
accession numbers for the complete genome sequences of 167 HTV variants; Table 22, 
showing an alignment of the complete polyprotein sequences of 50 HCV variants) (see 
also, Human Retroviruses and AIDS 2000: A Compilation and Analysis of Nucleic Acid 
and Amino Acid Sequences, Kuiken CL, et al., Eds. Theoretical Biology and Biophysics 
Group, Los Alamos National Laboratory, Los Alamos, NM). 
[00105] The nomenclature used to describe peptides/proteins follows the conventional practice 

wherein the amino group is presented to the left (the N-terminus) and the carboxyl group to the 
right (the C-tenninus) of each amino acid residue. When amino acid residue positions are referred 
to in a peptide epitope they are numbered in an amino to carboxyl direction with position one 
being the position closest to the amino terminal end of the epitope, or the peptide or protein of 
which it may be a part. In the formulae representing selected specific embodiments of the present 
invention, the amino- and carboxyl-tenninal groups, although not specifically shown, are in the 
form they would assume at physiologic pH values, unless otherwise specified. Li the amino acid 
structure formulae, each residue is generally represented by standard three letter or single letter 
designations. The L-form of an amino acid residue is represented by a capital single letter or a 
capital first letter of a three-letter symbol, and the D-form for those amino acids having D-forms is 
represented by a lower case single letter or a lower case three letter symbol. However, when three 
letter symbols or full names are used without capitals, they may refer to L amino acids. Glycine 
has no asymmetric carbon atom and is simply referred to as "Gly" or "G". The amino acid 
sequences of peptides set forth herein are generally designated using the standard single letter 
symbol. (A Alanine; C, Cysteine; D, Aspartic Acid; E, Glutamic Acid; F, Phenylalanine; G, 
Glycine; H, Histidine; I, Isoleucine; K, Lysine; L, Leucine; M, Methionine; N, Asparagine; P, 
Proline; Q, Glutamine; R, Arginine; S, Serine; T, Threonine; V, Valine; W, Tryptophan; and Y, 
Tyrosine.) In addition to these symbols, «B"in the single letter abbreviations used herein 
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designates a-amino butyric acid. Li some embodiments, a-amino butyric acid may be replaced 
with cysteine. 



APC 
CD3 
CD4 
CD8 
CEA 
CTL 
DC: 



DLT: 

DMSO: 

ELISA: 

E:T: 

G-CSF: 

GM-CSF: 

HBV: 

HER2/neu: 

HLA: 

HLA-DR: 

HPLC: 

HTC: 

HTL: 

ID: 

EFNy: 

IL-4: 

IV: 

LU 30 %: 

MAb: 

MAGE: 

MLR: 

MNC: 

PB: 

PBMC: 
ProGP™: 

SC: 

S.E.M.: 

QD: 

TAA: 

TNF: 

WBC: 



Acronyms used herein are as follows: 

Antigen presenting cell 

Pan T cell marker 

Helper T lymphocyte marker 

Cytotoxic T lymphocyte marker 

Carcinoembryonic antigen (see, e.g., SEQ ID NO: 363) 

Cytotoxic T lymphocyte 

Dendritic cells. DC functioned as potent antigen presenting cells by stimulating 
cytokine release from CTL lines that were specific for a model peptide derived 
from hepatitis B virus. In vivo experiments using DC pulsed ex vivo with an HBV 
peptide epitope have stimulated CTL immune responses in vivo following delivery 
tonaiVemice. 

Dose-limiting toxicity, an adverse event related to therapy. 

Dimethylsulfoxide 

Enzyme-linked immunosorbant assay 

EffectonTarget ratio 

Granulocyte colony-stimulating factor 

Granulocyte-macrophage (monocyte)-colony stimulating factor 
Hepatitis B virus 

A tumor associated antigen; c-erbB-2 is a synonym (see, e.g., SEQ ID NO: 364) 

Human leukocyte antigen 

Human leukocyte antigen class II 

High Performance Liquid Chromatography 

Helper T Cell 

Helper T Lymphocyte. A synonym for HTC. 
Identity 

Interferon gamma 
Interleukin-4 , 
Intravenous 

Cytotoxic activity for 10 6 effector cells required to achieve 30% lysis of a target 
cell population, at a 100:1 (E:T) ratio. 
Monoclonal antibody 

Melanoma antigen (see, e.g., SEQ ID NO: 365 and 366 for MAGE2 and MAGE3) 
Mixed lymphocyte reaction 
Mononuclear cells 
Peripheral blood 

Peripheral blood mononuclear cell 

Progenipoietin™product (Searle, St. Louis, MO), a chimeric flt3/G- 

CSF receptor agonist 

Subcutaneous 

Standard error of the mean 

Once a day dosing 

Tumor Associated Antigen 

Tumor necrosis factor 

White blood cells 
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[00106] The following describes the peptides, nucleic acid molecules, compositions, and 
methods of the invention in more detail. 

i 

Methods of Identifying Candidate Peptide Epitopes 



[00107] The present invention is directed to methods for selecting a variant of a peptide 

epitope which induces a CTL response against another variant(s) of the peptide epitope, by 
detennining whether the variant comprises only conserved residues, as defined herein, at 
non-anchor positions in comparison to the other variant(s). 

[00108] In some embodiments, antigen sequences from a population of an infectious agent, 
said antigens comprising variants of a peptide epitope, are optionally aligned (manually or 
by computer) along their length, preferably their full length. Variant(s) of a peptide 
epitope (preferably naturally occurring variants), each 8-11 amino acids in length and 
comprising the same MHC class I supermotif or motif, are identified manually or with the 
aid of a computer. In some embodiments, a variant is optionally chosen which comprises 
preferred anchor residues of said motif and/or which occurs with high frequency within 
the population of variants. In other embodiments, a variant is randomly chosen. The 
randomly or otherwise chosen variant is compared to from one to all the remaining 
variant(s) to determine whether it comprises only conserved residues in the non-anchor 
positions relative to from one to all the remaining variant(s). 

[00109] The present invention is also directed to variants identified by the methods above; 

peptides comprising such variants; nucleic acids encoding such variants and peptides; cells 
comprising such variants, and/or peptides, and/or nucleic acids; compositions comprising 
such variants, and/or peptides, and/or nucleic acids, and/or cells; as well as therapeutic and 
diagnostic methods for using such variants, peptides, nucleic acids, cells, and 
compositions. 

[00110] In some embodiments, the invention is directed to a method for identifying a 
candidate peptide epitope which induces a HLA class I CTL response against variants of 
said peptide epitope, comprising 

a) identifying, from a particular antigen of an infectious agent, variants of 
a peptide epitope 8-1 1 amino acids in length, each variant comprising 
primary anchor residues of the same HLA class I binding motif; and 
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b) determining whether one of said variants comprises only conserved 
non-anchor residues in comparison to at least one remaining variant, 
thereby identifying a candidate peptide epitope. 

[00111] Jh some embodiments, (b) comprises identifying a variant which comprises only 
conserved non-anchor residues in comparison to at least 25%, at least 50%, at least 75%, 
at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, or at least 99% of the 
remaining variants. 

[00112] In some embodiments, the invention is directed to a method for identifying a 
candidate peptide epitope which induces a HLA class I CTL response against variants of 
said peptide epitope, comprising 

a) identifying, from a particular antigen of an infectious agent, variants of 
a peptide epitope 8-1 1 amino acids in length, each variant comprising 
primary anchor residues of the same HLA class I binding motif; 

b) determining whether each of said variants comprises conserved, semi- 
conserved or non-conserved non-anchor residues in comparison to each 
of the remaining variants; and 

c) identifying a variant which comprises only conserved non-anchor 
residues in comparison to at least one remaining variant. 

[00113] In some embodiments, (c) comprises identifying a variant which comprises only 

conservative non-anchor residues in comparison to at least 25%, at least 50%, at least 
. 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, or at least 99% 

of the remaining variants. 
[00114] In some embodiments, the invention is directed to a method for identifying a 

candidate peptide epitope which induces a HLA class I CTL response against variants of 

said peptide epitope, comprising 

a) identifying, from a particular antigen of an infectious agent, a 
population of variants of a peptide epitope 8-1 1 amino acids in length, 
each peptide epitope comprising primary anchor residues of the same 
HLA class I binding motif; 

b) choosing a variant selected from the group consisting of: 

i) a variant which comprises preferred primary anchor residues 
of said motif; and 
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ii) a variant which occurs with high frequency within the 
population of variants; and 
c) determining whether the variant of (b) comprises only conserved non- 
anchor residues in comparison to at least one remaining variant, thereby 
identifying a candidate peptide epitope. 

[00115] In some embodiments, (c) comprises identifying a variant which comprises only 
conservative non-anchor residues in comparison to at least 25%, at least 50%, at least 
75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, or at least 99% 
of the remaining variants. 

[00116] In some embodiments, the invention is directed to method for identifying a 

candidate peptide epitope which induces a HLA class I CTL response against variants of 
said peptide epitope, comprising 

a) identifying, from a particular antigen of an infectious agent, a 
population of variants of a peptide epitope 8-1 1 amino acids in length, 
each peptide epitope comprising primary anchor residues of the same 
HLA class I binding motif; 

b) choosing a variant selected from the group consisting of: 

i) a variant which comprises preferred primary anchor residues of 
said motif; and 

ii) a variant which occurs with high frequency within the 
population of variants; and 

c) determining whether the variant of (b) comprises conserved, semi- 
conserved or non-conserved non-anchor residues in comparison to each 
of the remaining variants; and 

d) identifying a variant which comprises only conserved non-anchor 
residues in comparison to at least one remaining variant. 

[00117] In some embodiments, (d) comprises identifying a variant which comprises only 
conservative non-anchor residues in comparison to at least 25%, at least 50%, at least 
75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 97%, or at least 99% 
of the remaining variants. 

[00118] In some embodiments, (a) comprises aligning the sequences of said antigens. 
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[00119] In some embodiments, (b) comprises comprises choosing a variant which 

comprises preferred primary anchor residues of said motif. 
[00120] In some embodiments, (b) comprises comprises choosing a variant which occurs 

with high frequency within said population. 
[00121] hi some embodiments, (b) comprises ranking said variants by frequency of 

occurrence within said population. 
[00122] In some embodiments, (b) comprises choosing a variant which comprises 

preferred primary anchor residues of said motif and which occurs with high frequency 

within said population. 

[00123] In some embodiments, (b) comprises ranking said variants by frequency of 

occurrence within said population. 
[00124] In some embodiments, the identified variant comprises the fewest conserved 

anchor residues in comparison to each of the remaining variants. 
[00125] In some embodiments, the remaining variants comprise 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 

11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 27, 28, 30, 35, 40, 45, 50, 

55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 

220, 240, 260, 280, or 300 variants. 
[00126] In some embodiments, the infectious agent is selected from the group consisting 

of: HTV, HBV, HCV, HPV, Plasmodium falciparum, Influenza virus, and Dengue virus, 

Epstein-Barr virus, Mycobacterium tuberculosis, Chlamydia, Candida albicans, Cryptococcus 

neofortnans, Coccidoides spp., Histoplasma spp, Aspergillus fumigatis, Plasmodium spp., 

Trypanosoma spp, Schistosoma spp., and Leishmania spp. 
[00127] In some embodiments, the infectious agent is selected from the group consisting 

of: HIV , HBV, HCV, HPV, Plasmodium falciparum, Influenza virus, and Dengue virus. 
[00128] In some embodiments, the infectious agent is HTV and the antigen is selected from 

the group consisting of: Gag, Env, Pol, Nef, Rev, Tat, Vif, Vpr, and Vpu. 
[00129] In some embodiments, the infectious agent is HBV and the antigen is selected from 

the group consisting of: Pol, Env, Core, and NSl/Env2. 
[00130] In some embodiments, the infectious agent is HCV and the antigen is selected from 

the group consisting of: Core, El, E2, NS1, NS2, NS3, NS4, and NS5. 
[00131] In some embodiments, the infectious agent is HPV and the antigen is selected from 

the group consisting of: El, E2, E3, E4, E5, E6, E7, LI, and L2. 
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[00132] In some embodiments, the infectious agent is Plasmodium falciparum and the 
antigen is selected from the group consisting of: CSP, SSP2, EXPl, LSA1. 

[00133] In some embodiments, the selected variant and the at least one remaining variant 
comprise different primary anchor residues of the same motif or supermotif. 

[00134] In some embodiments, the motif or supermotif is selected from the group 
consisting of those in Tables 1-2. 

[00135] In some embodiments, the conserved non-anchor residues are at any of positions 3- 
7 of said variant. 

[00136] In some embodiments, the variant comprises only 1-3 conserved non-anchor 

residues compared to at least one remaining variant. 
[00137] In some embodiments, the variant comprises only 1-2 conserved non-anchor 

residues compared to at least one remaining variant. 
[00138] In some embodiments, the variant comprises only 1 conserved non-anchor residue 

compared to at least one remaining variant. 
[00139] In some embodiments, the infectious agent is HPV, and further wherein, the HPV 

infectious agent is selected from the group consisting of HPV strains 16, 18, 31, 33, 45, 

52, 56, and 58. 

[00140] In some embodiments, the variants are a population of naturally occurring variants. 

[00141] Optional Alignment. Optionally, antigen sequences, either full-length or partial, 

may be aligned mannually or by computer. Convenient computer programs for aligning 
multiple sequences include Omiga, Oxford software, version 1.1.3, using ClustalW 
alignment, using an open gap penalty of 10.0, extend gap penalty of 0.05, and delay 
divergent sequences of 40.0 (See, e.g., Table 21); and BLASTP 2.2.5 (Nov-16-2002) 
(Altschul, S.F., et al., Nucleic Acids Res. 25:3389-3402 (1997)) using a cutoff = 3e-88 (to 
select human sequences) (see, e.g., Table 20). Alternatively, alignments may be obtained 
through publicly available sources such as published journal articles and published patent 
documents or as disclosed herein (see, e.g., Tables 10-22). 

[00142] HLA Class I Motifs Indicative of CTL Inducing Peptide Epitopes. A large 
fraction of HLA class I and class H molecules can be classified into a relatively few 
supertypes, each respective supertype characterized by largely overlapping peptide 
binding repertoires, and consensus structures of the main peptide binding pockets. Thus, 
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peptides of the present invention are preferably identified by the primary residues of any 
one of several HLA-specific amino acid motifs, or if the presence of the motif corresponds 
to the ability to bind several allele-specific HLA antigens, a supermotif {see, e.g., Tables 
1-2). The preferred primary residues are indicated in bold, while the tolerated primary 
residues are indicated by italics. 
[00143] The primary anchor residues of the HLA class I peptide epitope supermotifs and 
motifs are summarized in Tables 1-2. Preferred primary anchors are shown in bold, while 
tolerated primary anchors are shown in italics. Primary and secondary anchor positions 
for HLA Class I are summarized in Table 3. Allele-specific HLA molecules that fall 
within the various HLA class I supertypes are listed in Table 4. In some cases, patterns of 
amino acid residues are present in both a motif and a supermotif. The relationship of a 
particular motif and any related supermotif is indicated in the description of the individual 
motifs. 

[00144] Thus, the peptide motifs and supermotifs described below, and summarized in 

Tables 1-2, provide guidance for the identification and use of peptide epitopes comprising 
primary anchor residues of motifs or supermotifs in accordance with the invention. 

[00145] Allele-specific HLA molecules that comprise HLA class I supertype families are 
listed in Table 4. 

[00146] HLA-A1 supermotif. The HLA-A1 supermotif is characterized by the presence in 
peptide ligands of a small (T or S) or hydrophobic (L, I, V, or M) primary anchor residue 
in position 2, and an aromatic (Y, F, or W) primary anchor residue at the C-terminal 
position of the epitope. The corresponding family of HLA molecules that bind to the Al 
supermotif (i.e., the HLA-A1 supertype) is comprised of at least A*0101, A*2601, 
A*2602, A*2501, and A*3201 {see, e.g., DiBrino, M. et al, J. Immunol. 151:5930, 1993; 
DiBrino, M. et al, J. Immunol 152:620, 1994; Kondo, A. et al, Immunogenetics 45:249, 
1997). Other allele-specific HLA molecules predicted to be members of the Al 
superfamily are shown in Table 4. Peptides binding to each of the individual HLA 
proteins can be modulated by substitutions at primary and/or secondary anchor positions, 
preferably choosing respective residues specified for the supermotif. 

[00147] HLA-A2 supermotif. Primary anchor specificities for allele-specific HLA-A2.1 
molecules (see, e.g., Falk et al, Nature 351:290-296, 1991; Hunt et al, Science 255:1261- 
1263, 1992; Parker et al, J. Immunol 149:3580-3587, 1992; Ruppert et al, Cell 74:929- 
937, 1993) and cross-reactive binding among HLA-A2 and -A28 molecules have been 
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described. (See, e.g., Fruci et al 9 Human Immunol 38:187-192, 1993; Tanigaki et al, 
Human Immunol 39:155-162, 1994; Del Guercio et al, J. Immunol 154:685-693, 1995; 
Kast et al, J. Immunol 152:3904-3912, 1994 for reviews of relevant data.) These primary 
anchor residues define the HLA-A2 supermbtif; which presence in peptide ligands 
corresponds to the ability to bind several different HLA-A2 and -A28 molecules. The 
HLA-A2 supennotif comprises peptide ligands with L, I, V, M, A, T, or Q as a primary 
anchor residue at position 2 and L, I, V, M, A, or T as a primary anchor residue at the C- 
terminal position of the epitope. 

[00148] The corresponding family of HLA molecules (i.e., the HLA-A2 supertype that 

binds these peptides) is comprised of at least: A*0201, A*0202, A*0203, A*0204, 
A*0205, A*0206, A*0207, A*0209, A*0214, A*6802, and A*6901. Other allele-specific 
HLA molecules predicted to be members of the A2 superfamily are shown in Table 4. As 
explained in detail below, binding to each of the individual allele-specific HLA molecules 
can be modulated by substitutions at the primary anchor and/or secondary anchor 
positions, preferably choosing respective residues specified for the supermotif. 

[00149] The motifs comprising the primary anchor residues V, A, T, or Q at position 2 and 

L, I, V, A, or T at the C-terminal position are those most particularly relevant to the 
invention claimed herein. 

[00150] HLA-A3 supermotif. The HLA-A3 supermotif is characterized by the presence in 

peptide ligands of A, L, I, V, M, S, or, T as a primary anchor at position 2, and a positively 
charged residue, R or K, at the C-terminal position of the epitope, e.g., in position 9 of 9- 
mers (see, e.g., Sidney et al, Hum. Immunol 45:79, 1996). Exemplary members of the 
corresponding family of HLA molecules (the HLA-A3 supertype) that bind the A3 
supermotif include at least A*0301, A*1101, A*3101, A*3301, and A*6801. Other allele- 
specific HLA molecules predicted to be members of the A3 supertype are shown in Table 
4. As explained in detail below, peptide binding to each of the individual allele-specific 
HLA proteins can be modulated by substitutions of amino acids at the primary and/or 
secondary anchor positions of the peptide, preferably choosing respective residues 
specified for the supermotif. 

[00151] HLA-A24 supermotif. The HLA-A24 supermotif is characterized by the presence 
in peptide ligands of an aromatic (F, W, or Y) or hydrophobic aliphatic (L, I, V, M, or T) 
residue as a primary anchor in position 2, and Y, F, W, L, I, or M as primary anchor at the 
C-terminal position of the epitope (see, e.g., Sette and Sidney, Immunogenetics, in press, 
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1999). The corresponding family of HLA molecules that bind to the A24 supennotif (i e 
the A24 supertype) includes at least A*2402, A*3001, and A*2301. Other allele-specific 
HLA molecules predicted to be members of the A24 supertype are shown in Table 4 
Peptide binding to each of the allele-specific HLA molecules can be modulated by 
substxtutions at primary and/or secondary anchor positions, preferably choosing respective 
residues specified for the supennotif. 
[00152] HLA-B7 supermotif. The HLA-B7 supennotif is characterized by peptides 
bearing proline in position 2 as a primary anchor, and a hydrophobic or aliphatic amino 
acid (L, I, V, M, A, F, W, or Y) as the primary anchor at the C-terminal position of the 
epitope. The corresponding family of HLA molecules that bind the B7 supermotif (i e 
the HLA-B7 supertype) is comprised of at least twenty six HLA-B proteins including- 
B*0702, B*0703, B*0704, B*0705, B*1508, B*3501, B*3502, B*3503, B*3504 B*350 5 ' 
B*3506, B*3507, B*3508, B*5101, B*5102, B*5103, B*5104, B*5105, B*530i'b*540i' 
B*5501, B*5502, B*5601, B*5602, B*6701, and B*7801 (see, e. g ., Sidney, \r al, J 
Immunol 154:247, 1995; Barber, et al, Curr. Biol 5:179, 1995; Hill, et al, Nature 
360:434, 1992; Rammensee, et al, Immunogenetics 41:178, 1995 for reviews of relevant 
data). Other allele-specific HLA molecules predicted to be members of the B7 supertype 
are shown in Table 4. As explained in detail below, peptide binding to each of the 
individual allele-specific HLA proteins can be modulated by substitutions at the primary 
and/or secondary anchor positions of the peptide, preferably choosing respective residues 
specified for the supermotif. 

100153] HLA-B27 supermotif. The HLA-B27 supermotif is characterized by the presence 
in peptide ligands of a positively charged (R, H, or K) residue as a primary anchor at 
position 2, and a hydrophobic (F, Y, L, W, M, I, A, or V) residue as a primary anchor at 
the C-terminal position of the epitope (see, e.g., Sidney and Sette, Immunogenetics in 
press, 1999). Exemplary members of the corresponding family of HLA molecules that 
bind to the B27 supermotif (i.e., the B27 supertype) include at least B*1401 B*1402 
B*1509, B*2702, B*2703, B*2704, B*2705, B*2706, 8*3801. B*3901, B*3902 and 
B*7301. Other allele-specific HLA molecules predicted to be members of the B27 
supertype are shown in Table 4. Peptide binding to each of the allele-specific HLA 
molecules can be modulated by substitutions at primary and/or secondary anchor 
positions, preferably choosing respective residues specified for the supermotif. 
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[00154] HLA-B44 supermotif. The HLA-B44 supermotif is characterized by the presence 
in peptide ligands of negatively charged (D or E) residues as a primary anchor in position 
2, and hydrophobic residues (F, W, Y, L, I, M, V, or A) as a primary anchor at the C- 
terminal position of the epitope (see, e.g., Sidney et al., Immunol. Today 17:261, 1996). 
Exemplary members of the corresponding family of HLA molecules that bind to the B44 
supermotif (i.e., the B44 supertype) include at least: B*1801, B*1802, B*3701, B*4001, 
B*4002, B*4006, B*4402, B*4403, and B*4006. Peptide binding to each of me allele- 
specific HLA molecules can be modulated by substitutions at primary and/or secondary 
anchor positions; preferably choosing respective residues specified for the supermotif. 
[00155] HLA-B58 supermotif. The HLA-B58 supermotif is characterized by the presence 
in peptide ligands of a small aliphatic residue (A, S, or T) as a primary anchor residue at 
position 2, and an aromatic or hydrophobic residue (F, W, Y, L, I, V, M, or A) as a 
primary anchor residue at the C-terminal position of the epitope (see, e.g., Sidney and 
Sette, Immunogenetics, in press, 1999 for reviews of relevant data). Exemplary members 
of the, corresponding family of HLA molecules that bind to the B58 supermotif (i.e., the 
B58 supertype) include at least: B*1516, B*1517, B*5701, B*5702, and B*5801. Other 
allele-specific HLA molecules predicted to be members of the B58 supertype are shown in 
Table 4. Peptide binding to each of the allele-specific HLA molecules can be modulated 
by substitutions at primary and/or secondary anchor positions, preferably choosing 
respective residues specified for the supermotif. 
[00156] HLA-B62 supermotif. The HLA-B62 supermotif is characterized by the presence 
in peptide ligands of the polar aliphatic residue Q or a hydrophobic aliphatic residue (L, V, 
M, L or P) as a primary anchor in position 2, and a hydrophobic residue (F, W, Y, M, L V, 
L, or A) as a primary anchor at the C-terminal position of the epitope (see, e.g., Sidney 
and Sette, Immunogenetics, in press, 1999). Exemplary members of the corresponding 
family of HLA molecules that bind to the B62 supermotif (i.e., the B62 supertype) include 
at least: B*1501, B*1502, B*1513, and B5201. Other allele-specific HLA molecules 
predicted to be members of the B62 supertype are shown in Table 4. Peptide binding to 
each of the allele-specific HLA molecules can be modulated by substitutions at primary 
and/or secondary anchor positions, preferably choosing respective residues specified for 
the supermotif. 

[00157] HLA-A1 motif. The HLA-Al motif is characterized by the presence in peptide 
ligands of T, S, or M as a primary anchor residue at position 2 and the presence of Y as a 
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primary anchor residue at the C-terminal position of the epitope. An alternative allele- 
specific Al motif is characterized by a primary anchor residue at position 3 rather than 
position 2. This motif is characterized by the presence of D, E, A, or S as a primary 
anchor residue in position 3, and a Y as a primary anchor residue at the C-terminal 
position of the epitope (see, e.g., DiBrino et al, J. Immunol., 152:620, 1994; Kondo et al, 
Immunogenetics 45:249, 1997; and Kubo et al., J. Immunol. 152:3913, 1994 for reviews of 
relevant data). Peptide binding to HLA Al can be modulated by substitutions at primary 
and/or secondary anchor positions, preferably choosing respective residues specified for 
the motif. 

[00158] Those epitopes comprising T, S, or M at position 2 and Y at the C-terminal 
position are also HLA-A1 supermotif-bearing peptide epitopes, as these residues are a 
subset of the Al supermotif primary anchors. 

[00159] HLA-A*0201 motif. An HLA-A2*0201 motif was determined to be characterized 
by the presence in peptide ligands of L or M as a primary anchor residue in position 2, and 
L or V as a primary anchor residue at the C-terminal position of a 9-residue peptide {see, 
e.g., Falk et al, Nature 351:290-296, 1991) and was further found to comprise an I at 
position 2 and I or A at 1he C-terminal position of a nine amino acid peptide {see, e.g., 
Hunt et al, Science 255:1261-1263, March 6, 1992; Parker et al, J. Immunol 149:3580- 
3587, 1992). The A*0201 allele-specific motif has also been denned by the present 
inventors to additionally comprise V, A, T, or Q as a primary anchor residue at position 2, 
and M orT as a primary anchor residue at the C-terminal position of the epitope {see, e.g, 
Kast et al, J. Immunol 152:3904-3912, 1994). Thus, the HLA-A*0201 motif comprises 
peptide ligands with L, I, V, M, A, T, or Q as primary anchor residues at position 2 and L, 
I, V, M, A, or T as a primary anchor residue at the C-terminal position of the epitope. The 
preferred and tolerated residues that characterize the primary anchor positions of the HLA- 
A*0201 motif are identical to the residues describing the A2 supermotif. (For reviews of 
relevant data, see, e.g., Del Guercio et al, J. Immunol. 154:685-693, 1995; Ruppert et al, 
Cell 74:929-937, 1993; Sidney et al, Immunol. Today 17:261-266, 1996; Sette and 
Sidney, Curr. Opin. in Immunol 10:478-482, 1998). Secondary anchor residues that 
characterize the A*0201 motif have additionally been defined {see, e.g., Ruppert et al., 
Cell 74:929-937, 1993). These are shown in Table 3. Peptide binding to HLA-A*0201 
molecules can be modulated by substitutions at primary and/or secondary anchor 
positions, preferably choosing respective residues specified for the motif. 
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[00160] HLA-A3 motif. The HLA-A3 motif is characterized by the presence in peptide 
ligands of L, M, V, I, S, A, T, F, C, G, or D as a primary anchor residue at position 2, and 
the presence of K, Y, R, H, F, or A as a primary anchor residue at the C-terminal position 
of the epitope (see, e.g., DiBrino et al, Proc. Natl. Acad. Sci USA 90:1508, 1993; and 
Kubo et al, J. Immunol 152:3913-3924, 1994). Peptide binding to HLA-A3 can be 
modulated by substitutions at primary and/or secondary anchor positions, preferably 
choosing respective residues specified for the motif. 

[00161] The A3 supermotif primary anchor residues comprise a subset of the A3- and Al 1- 
allele specific motif primary anchor residues. 

[00162] HLA-A11 motif. The HLA-A1 1 motif is characterized by the presence in peptide 
ligands of V, T, M, L, I, S, A, G, N, C, D, or F as a primary anchor residue in position 2, 
and K R, Y, or H as a primary anchor residue at the C-terminal position of the epitope 
(see, e.g., Zhang et al, Proc. Natl Acad. Sci USA 90:2217-2221, 1993; and Kubo et al, J. 
Immunol 152:3913-3924, 1994). Peptide binding to HLA-A11 can be modulated by 
substitutions at primary and/or secondary anchor positions, preferably choosing respective 
residues specified for the motif. 

[00163] There is extensive overlap between the A3 and All motif primary anchor 
specificities. 

[00164] HLA-A24 motif. The HLA-A24 motif is characterized by the presence in peptide 
ligands of Y, F, W, or M as a primary anchor residue in position 2, and F, L, I, or W as a 
primary anchor residue at the C-terminal position of the epitope (see, e.g., Kondo et al, J. 
Immunol. 155:4307-4312, 1995; and Kubo et al, J. Immunol. 152:3913-3924, 1994). 
Peptide binding to HLA-A24 molecules can be modulated by substitutions at primary 
and/or secondary anchor positions; preferably choosing respective residues specified for 
the motif. 

[00165] The primary anchor residues characterizing the A24 allele-specific motif comprise 
a subset of the A24 supermotif primary anchor residues. 

[00166] Computer or Manual Screening. Peptides bearing HLA Class I or Class H 
supermotifs or motifs may be identified by computer searches or manually, e.g., as 
follows. In utilizing computer screening to identify peptide epitopes, a protein sequence or 
translated sequence may be analyzed using software developed to search for motifs, for 
example the "FINDP ATTERNS' program (Devereux, et al. Nucl. Acids Res. 12:387-395, 
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1984) or MotifSearch 1.4 software program (D. Brown, San Diego, CA) to identify 
potential peptide sequences containing appropriate HLA binding motifs. The identified 
peptides can be scored using customized polynomial algorithms to predict their capacity to 
bind specific HLA class I or class H alleles. As appreciated by one of ordinary skill in the 
art, a large array of computer programming software and hardware options are available in 
the relevant art which can be employed to implement the motifs in order to evaluate (e.g., 
without limitation, to identify epitopes, identify epitope concentration per peptide length, 
or to generate analogs) known or unknown peptide sequences. 

[00167] Translated antigen protein sequences may be analyzed using a text string search 
software program, e.g., MotifSearch 1.4 (D. Brown, San Diego) to identify potential 
peptide sequences containing appropriate HLA binding motifs; alternative programs are 
readily produced in accordance with information in the art in view of the motif/supermotif 
disclosure herein. Furthermore, such calculations can be made mentally. 

[00168] Identified supermotif or motif sequences may be scored using polynomial 
algorithms to predict their capacity to bind to specific HLA-Class I or Class H molecules. 
These polynomial algorithms take into account both extended and refined motifs (that is, 
to account for the impact of different amino acids at different positions), and are 
essentially based on the premise that the overall affinity (or AG) of peptide-HLA molecule 
interactions can be approximated as a linear polynomial function of the type: 

"AG" = a u x a 2f x a 3i x a„, 

where ap is a coefficient which represents the effect of the presence of a given amino acid 
0) at a given position (i) along the sequence of a peptide of n amino acids. The crucial 
assumption of this method is that the effects at each position are essentially independent of 
each other (i.e., independent binding of individual side-chains). When residuey occurs at 
position i in the peptide, it is assumed to contribute a constant amount j t to the free energy 
of binding of the peptide irrespective of the sequence of the rest of the peptide. This 
assumption is justified by studies from our laboratories that demonstrated that peptides are 
bound to MHC and recognized by T cells in essentially an extended conformation (data 
omitted herein). 

[00169] The method of derivation of specific algorithm coefficients has been described in 
Gulukota et al, J. Mol. Biol. 267:1258-126, 1997; (see also Sidney et al, Human 
Immunol. 45:79-93, 1996; and Southwood et al, J. Immunol. 160:3363-3373, 1998). 
Briefly, for all i positions, anchor and non-anchor alike, the geometric mean of the average 
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relative binding (ARB) of all peptides carrying./ is calculated relative to the remainder of 
the group, and used as the estimate of/,. For Class H peptides, if multiple alignments are 
possible, only the highest scoring alignment is utilized, following an iterative procedure. 
To calculate an algorithm score of a given peptide in a test set, the ARB values 
corresponding to the sequence of the peptide are multiplied. If this product exceeds a 
chosen threshold, the peptide is predicted to bind. Appropriate thresholds are chosen as a 
function of the degree of stringency of prediction desired. 
[00170] Additional methods to identify preferred peptide sequences, which also make use 
of specific motifs, include me use of neural networks and molecular modeling programs 
(see, e.g., Milik et al, Nature Biotechnology 16:753, 1998; Altuvia et al, Hum. Immunol. 
58:1, 1997; Altuvia et al, J. Mol Biol. 249:244, 1995; Buus, S. Curr. Opin. Immunol. 
11:209-213, 1999; Brusic, V. et al, Bioinformatics 14:121-130, 1998; Parker et al., J. 
Immunol. 152:163, 1993; Meister et al, Vaccine 13:581, 1995; Hammer et al, J. Exp. 
Med. 180:2353, 1994; Sturniolo etal, Nature Biotechnol 17:555 1999). 

[00171] Conserved, Semi-conserved, and Non-conserved Non-anchor Residues. The 

determination of non-anchor residues as being conserved (conservative) or semi-conserved 
(semi-conservative) or non-conserved (non-conservative) in comparison to the non-anchor 
poitions of from one to all of the remaining variant(s) is defined by as follows, the results 
of which are summarized in Table 5. 

[00172] Table 5 shows the similarity assignments between any given amino acid pair so 
that a given amino acid substitution could be characterized as being a (conservative) or 
semi-conserved (semi-conservative) or non-conserved (non-conservative) residue. 

[00173] The degree of similarity between amino acid pairs was quantified by averaging, for 
each amino acid pair, the rank coefficient scores for PAM250, hydrophobicity, and side 
chain volume as described below. Based on the average values of these composite 
rankings, Table 5 shows each pair to be conserved, semi-conserved or non-conserved. 

[00174] The Dayhoff PAM250 score (Dayhoff, M.O., et al, Atlas of Protein Sequence and 
Structure, Vol. 5, suppl.3. (1978) M.O. Dayhoff, ed. National Biomedical Research 
Foundation, Washington DC, p. 345; Creighton, T.E., Proteins: structures and molecular 
properties (1993) (2nd edition) W.H. Freeman and Company, NY; 
http://prowl.rockefeUer.edu/aainfo/pam250. html) is a commonly utilized protein 
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alignment scoring matrix which measures the percentage of acceptable point mutations 
(PAM) within a defined time frame. The frequencies of these mutations are different from 
what would be expected from the probability of random mutations, and presumably reflect 
a bias due to the degree of physical and chemical similarity of the amino acid pair 
involved in the substitution. To obtain a score of amino acid similarity that could be 
standardized with other measures of similarity, the PAM250 scores were converted to a 
rank value, where 1 indicates the highest probability of being an accepted mutation. 
[00175] The most commonly utilized scales to represent the relative hydrophobicity of the 
20 naturally occurring amino acids (Cornette, J., etal, J. Mol. Biol. (1987) 195:659) are 
those developed on the basis of experimental data by Kyte and Doolittle (Kyte, J. and R.R 
Doolittle, J. Mol. Biol. (1982) 157:105), and by Fauchere and Pliska (Fauchere, J. and V. 
Pliska, Eur. J. Med. Chem. ( 1983) 18:369). The Kyte/Doolittle scale measures the 
H 2 0/organic solvent partition of individual amino acids. Because it considers the position 
of amino acids in folded proteins, it may most accurately reflect native hydrophobicity in 
the context of proteins. The Fauchere/Pliska scale measures the octanol/H 2 0 partitioning 
of N-acetyl amino acid amides, and most accurately reflects hydrophobicity in the context 
of denatured proteins and/or small synthetic peptides. To obtain scores for 
hydrophobicity, each amino acid residue was ranked on both the Kyte/Doolittle and 
Fauchere/Pliska hydrophobicity scales. An average rank between the two scales was 
calculated and the average difference in hydrophobicity for each pair was calculated. 
[00176] Finally, for calculating amino acid side-chain volume, the partial volume in 
solution obtained by noting the increase in volume of water after adding either one 
molecule or one gram of amino acid residue was considered (Zamyatnin, A. A., Ann. Rev. 
Biophys. Bioeng. (1984) 13:145; Zamyatnin, A.A., Prog. Biophys. Mol. Biol. (1972) 
24:107). The absolute difference in the partial volume of each possible pairing of the 20 
naturally occurring amino acids was calculated and ranked, where 1 indicated residues 
with the most similar volumes, and 20 the most dissimilar. 
[00177] Thus, by consulting Table 5, one can determine whether a residue in a variant is 
considered to be conserved, semi-conserved, or non-conserved in comparison to a residue 
in another variants). The residue of the parent variant (randomly or otherwise chosen 
variant) is shown across the top of Table 5, and the residue of the variants) it is compared 
with is shown below the parent residue. 
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100178] As shown in Table 5, each of the amino acids shown across the top of the table 
bears a numerically defined relationship to the remaining 19 genetically encoded amino 
acids. The lower the index, the higher the conservation; the same amino acid will have a 
similarity assignment of 1.0; maximally different amino acids will have similarity 
assignments approaching 20. Using the method set forth above, amino acids which are not 
gene-encoded can also be assigned similarity indices and can be classified with respect to 
any natively occurring amino acid as conserved (conservative) or semi-conserved (semi- 
conservative) or non-conserved (non-conservative). 

Variant Peptide Epitopes 

[00179] In some embodiments, the invention is directed to an isolated peptide comprising or 

consisting of a variant. In some embodiments, the invention is directed to an isolated 
polynucleotide encoding such a peptide. 
[00180] The isolated variants of the invention are all class I binding peptides, i.e., CTL peptides. 

In particular, the variants of the invention comprise a motif or supermotif, as described above. 
Variants of the invention are those set forth in Tables 6-9 and Figures 1A-4 (SEQ ID Nos:J. 
Variants of the invention may be referred to herein as "variants" and "variant peptide epitopes" or 
referred to by Table or referred to by SEQ ID NO. Other peptide epitopes are referred to herein as 
CTL epitopes or CTL peptides and HTL epitopes or HTL peptides. 
[00181] Peptides and Polynucleotides. Li some embodiments, the invention is directed to an 

isolated peptide comprising or consisting of a variant, wherein the variant consists of a sequence 
selected from those in Tables 6-9 and Figures 1 A-4 (SEQ ID Nos:_J. 
[00182] Peptides of the invention may be fusion proteins of variants) to CTL epitope(s), and/or 

HTL epitope(s), and/or linkers), and/or spacer(s), and/or carriers), and/or additional amino 
acid(s), and/or may comprise or consist of homopolymers of a variant or heteropolymers of more 
than one variant, as is described in detail below. 
[00183] Peptides which comprise a variant of the invention may comprise or consist of a fragment 

of an antigen ("fragment" or "antigenic fragment"), wherein the fragment comprises a variant. The 
fragment may be a portion of any antigen of an infectious agent, e.g., the sequences in Tables 1 1- 
22 (SEQ ID Nos:^ respectively). The variant of the invention may be within the fragment or 
may be linked, directly or indirectly, to the fragment. 
[00184] The fragment may comprise or consist of a region of a native antigen that contains a high 

concentration of class I and/or class H epitopes, preferably it contains the greatest number of 
epitopes per amino acid length. Such epitopes can be present in a frame-shifted manner, e.g. a 10 
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amino acid long peptide could contain two 9 amino acid long epitopes and one 10 amino acid long 
epitope. 

[00185] Hie fragment may be less than or equal to 600 amino acids, less than or equal to 500 

amino acids, less than or equal to 400 amino acids, less than or equal to 250 amino acids, less than 
or equal to 100 amino acids, less than or equal to 85 amino acids, less than or equal to 75 amino 
acids, less than or equal to 65 amino acids, or less than or equal to 50 amino acids in length. In 
certain embodiments, a fragment is less than 101 amino acids in length, in any increment down to 
5 amino acids in length. For example, the fragment may be 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 
43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 

69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 
95, 96, 97, 98, 99, or 100 amino acids in length. Fragments of full length antigens may be 
fragments from about residue 1-20,21-40,41-60, 61-80, 81-100, 101-120, 121-140, 141-160, 161- 
180, 181-200, 201-220, 221-240, 241-260, 261-280, 281-300, 301-320, 321-340, 341-360, 361- 
380, 381-400, 401-420, 421-440, 441-460, 461-480, 481-500, 501-520, 521-540, 541-560, 561- 
580, 581-600, 601-620, 621-680, 681-700, 701-720, 721-740, 741-780, 781-800, §01-820, 821- 
840, 841-860, 861-880, 881-900, 901-920, 921-940, 941-960, 961-980, 981 to the C-terminus of 
the antigen. 

[00186] Peptides which comprise a variant of the invention may be a fusion protein comprising one 

or more amino acid residues in addition to the variant or fragment. Fusion proteins include 
homopolymers and heteropolymers, as described below. 

[00187] In some embodiments, the peptide comprises or consists of multiple variants, e.g., 2, 3, 4, 

5, 6, 7, 8, or 9 variants of the invention. In some embodiments, the peptide comprises at least 1, at 
least 2, at least 3, at least 4, at least 5, at least 6, at least 7, or at least 8 variants of the invention. 

[00188] The peptide may also be a homopolymer of one variant or the peptide may be a 

heteropolymer which contains at least two different variants. Polymers have the advantage of 
increased probability for immunological reaction and, where different variants are used to make up 
the polymer, the ability to induce antibodies and/or T cells that react with different antigenic 
determinants of the antigen(s) targeted for an immune response. 

[00189] A homopolymer may comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 

20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 45, 50, 55, 60, 65, 

70, 75, 80, 85, 90, 95, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, or 150 copies of the same 
variant. 

[00190] A heteropolymer may comprise one or more copies of an individual variant and one or 

more copies of one or more different variants of the invention. The variants that form a 
heteropolymer may all be from the same antigen, e.g., may be from any of those in Tables 1 1-22 
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(SEQ ID NOS:_) or other antigens herein or known in the art, or may be from different antigens, 
preferably from infectious agents. Combinations of variants that may form a heteropolymer 
include, for example, Gag 545 variants EPLTSLKSLF (SEQ ID NO: J and YPLASLKSLF (SEQ 
ID NO:_), or combinations of peptides from different tables in Tables 6-9 and/or Figures 1 A-4 or 
those combinations in Tables 23-28. Heteropolymers may contain multiple copies of one or more 
variants. 

[00191] Thus, peptides of the invention such as heteropolymers may comprise a first variant and at 

least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 
29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 other 
(different) variants. 

[00192] Li some embodiments, the peptide comprising a variant may also comprise a number of 

CTL and/or HTL epitopes, e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 
22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47,' 
48, 49, or 50 CTL and/or HTL epitopes. 

[00193] The CTL and/or HTL epitope and the variant of the invention may be from the same 

antigen of an infectious agent or from different antigens. Thus, for example, if the variant is from 
HIV pol, the CTL peptide and/or HTL peptide may also be from HIV pol. Alternatively, if the 
variant is from HIV pol, the CTL peptide and/or HTL peptide may be from another antigen such as 
HIV env or HIV vpr. As another example, if the variant is from HBV E6, the CTL peptide and/or 
HTL peptide may be from HBV E7. The CTL and/or HTL epitope and the variant of the invention 
may be from the same infectious agent or different infectious agents. Thus, for example, the 
variant may be from HTV, and the CTL and/or HTL epitope may be from HIV or may be from 
another infectious agent sush such as HBV, HCV, HPV, or Plasmodium falciparum. 
[00194] The CTL peptide and/or HTL peptide may be from other antigens including hepatitis B 

core and surface antigens (HBVc, HBVs), hepatitis C antigens, Epstein-Barr virus antigens, human 
immunodeficiency virus (HTV) antigens and human papilloma virus (HPV) antigens (in particular 
anitgens from HPV-16, HPV-18, HPV-31, HPV-33, HPV-45, HPV-52, HPV-56 and HPV-58, 
Mycobacterium tuberculosis and Chlamydia. Examples of suitable fungal antigens include those 
derived from Candida albican, Cryptococcus neoformans, Coccidoides spp., Histoplasma spp, 
and Aspergillus fumigatis. Examples of suitable protozoan parasitic antigens include those derived 
fromPlasmodium spp., including P. falciparum, Trypanosoma spp., Schistosoma spp., Leishmania 
spp and the like. 

[00195] Alternatively, the CTL peptide and/or HTL peptide may be from tumor-associated 

antigens such as but not limited to, melanoma antigens MAGE-1, MAGE-2, MAGE-3, MAGE-11, 
MAGE-A10, as well as BAGE, GAGE, RAGE, MAGE-C1, LAGE-1, CAG-3, DAM, MUC1,' 
MUC2, MUC18, NY-ESO-1, MUM-1, CDK4, BRCA2, NY-LU-1, NY-LU-7, NY-LU-12,' 
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CASP8, RAS, KIAA-2-5, SCCs, p53, p73, CEA, HER2/neu, Melan-A, gplOO, tyrosinase, TRP2, 
gp75/TRPl, kallifcrein, prostate-specific membrane antigen (PSM), prostatic acid phosphatase 
(PAP), prostate-specific antigen (PSA), PT1-1, 3-catenin, PRAME, Telomerase, FAK, cyclin Dl 
protein, NOEY2, EGF-R, SART-1, CAPB, HPVE7, pl5, Folate receptor CDC27, PAGE-1, and 
PAGE-4. 

[00196] Examples of CTL peptides and HTL peptides are disclosed in WO 01/42270, 
published 14 June 2001; WO 01/41788, published 14 June 2001; WO 01/42270, published 
14 June 2001; WO 01/45728, published 28 June 2001; and WO 01/41787, published 14 
June 2001. 

[00197] The HTL peptide may comprise a "loosely HLA-restricted" or "promiscuous" 
sequence. Examples of amino acid sequences that are promiscuous include sequences 
from antigens such as tetanus toxoid at positions 830-843 (QYJXANSKFIGITE; SEQ ID 
NO: 627), Plasmodium falciparum CS protein at positions 378-398 
(DIEKKIAKMEKASSVFNVVNS; SEQ ID NO: 628), and Streptococcus 18kD protein at 
positions 116-131 (GAVDSJXGGVATYGAA; SEQ ID NO: 629). Other examples 
include peptides bearing a DR. 1-4-7 supermotif, or either of the DR3 motifs. 

[00198] The HTL peptide may comprise a synthetic peptide such as a Pan-DR-binding 
epitope {e.g., a PADRE® peptide, Epimmune Inc., San Diego, CA, described, for example, 
in U.S. Patent Number 5,736,142), for example, having the formula aKXVAAZTLKAAa, 
where "X" is either cyclohexylalanine, phenylalanine, or tyrosine; "Z" is either 
tryptophan, tyrosine, histidine or asparagine; and "a" is either D-alanine or L-alanine (SEQ 
ID NO: 746). Certain pan-DR binding epitopes comprise all "L" natural amino acids; 
these molecules can be provided as peptides or in the form of nucleic acids that encode the 
peptide. See also, U.S. Patent Nos. 5,679,640 and 6,413,935. 

[00199] The peptide comprising a variant may comprise additional amino acid(s). Such 
additional amino acids may be Ala, Arg, Asn, Asp, Cys, Gin, Gly, Glu, His, He, Leu, Lys, 
Met, Phe, Pro, Ser, Thr, Tyr, Tip, Val, amino acid mimetics, and other unnatural amino 
acids such as those described below. Additional amino acids may provide for ease of 
linking peptides one to another, for linking variants to one another, for linking variants to 
CTL and/or HTL epitopes, for coupling to a carrier support or larger peptide, for 
modifying the physical or chemical properties of the peptide or oligopeptide, or the like. 
Amino acids such as Ala, Arg, Asn, Asp, Cys, Gin, Gly, Glu, His, lie, Leu, Lys, Met, Phe, 
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Pro, Ser, Thr, Tyr, Trp, or Val, or the like, can be introduced at the C- and/or N-terminus 
of the peptide and/or can be introduced internally. 
[00200] The peptide comprising a variant may comprise an amino acid spacer(s), which 
may be joined to the variants, CTL epitopes, HTL epitopes, carriers, etc. within a peptide 
or may be joined to the peptide at the N-and/or C-terminus. Thus, spacers may be at the N- 
terminus or C-terminus of peptide, or may be internal such that they link or join variants, 
CTL epitopes, HTL epitopes, carriers, additional amino acids, and/or antigenic fragments' 
one to the other. 

[00201] The spacer is typically comprised of one or more relatively small, neutral 
molecules, such as amino acids or amino acid mimetics, which are substantially uncharged 
under physiological conditions. The spacers are typically selected from, e.g., Ala, Gly, or 
other neutral spacers of nonpolar amino acids or neutral polar amino acids. It will' be 
understood that the optionally present spacer may be composed of the same residues or 
may be composed of one or more different residues and thus may be a homo- or hetero- 
oligomer of spacer residues. Thus, the spacer may contain more than one Ala residue 
(poly-alanine) or more than one Gly residue (poly-glycine), or may contain both Ala and 
Gly residues, e.g., G ly, Gly-Gly-, Ser,Ser-Ser-, 

Gly-Ser-, Ser-Gly-, etc. When present, the spacer will usually be at least one or two 
residues, more usually three to six residues and sometimes 10 or more residues, e.g., 3, 4, 
5, 6, 7, 8, 9, or 10, or even more residues. (Livingston, B.D. et al. Vaccine 19 4652-4660 
(2000)). 

[00202] Peptides comprising a variant may comprise carriers) such as those well known in 
the art, e.g., thyroglobulin, albumins such as human serum albumin, tetanus toxoid, 
polyamino acids such as poly L-lysine, poly L-glutamic acid, influenza virus proteins,' 
hepatitis B virus core protein, and the like. (See Table 29). 

[00203] In addition, the peptide comprising or consisting of a variant may be modified by 
terminal-NH 2 acylation, e.g., by alkanoyl ( Cl -C 2 o) or thioglycolyl acetylation, terminal- 
carboxyl amidation, e.g., ammonia, memylamine, etc. In some instances these 
modifications may provide sites for linking to a support or other molecule. 
[00204] The peptides in accordance with the invention can contain modifications such as 
but not limited to glycosylate, side chain oxidation, biotinylation, phosphorylation, 
addition of a surface active material, e.g. a lipid, or can be chemically modified, e.g., 
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acetylation, etc. Moreover, bonds in the peptide can be other than peptide bonds, e.g., 
covalent bonds, ester or ether bonds, disulfide bonds, hydrogen bonds, ionic bonds, etc. 

[00205] Peptides of the present invention may contain substitutions to modify a physical 
property (e.g., stability or solubility) of the resulting peptide. For example, peptides may 
be modified by the substitution of a cysteine (C) with a-amino butyric acid ("B"). Due to 
its chemical nature, cysteine has the propensity to form disulfide bridges and sufficiently 
alter the peptide structurally so as to reduce binding capacity. Substituting a-amino 
butyric acid for C not only alleviates this problem, but actually improves binding and 
crossbinding capability in certain instances. Substitution of cysteine with a-amino butyric 
acid may occur at any residue of a peptide, e.g., at either anchor or non-anchor positions of 
a variant within a peptide, or at other positions of a peptide. 

[00206] The peptides comprising a variant can comprise amino acid mimetics or unnatural 
amino acids, e.g. D- or L-naphylalanine; D- or L-phenylglycine; D- or L-2-thieneylalanine; 
D- or L-l, -2, 3, or 4-pyreneylalanine; D- or L-3 thieneylalanine; D- or L-(2-pyridinyl)- 
alanine; D- or L-(3-pyridinyl)-alanine; D- or L-(2-pyrazinyl)-alanine; D- or L-(4-isopropyl> 
phenylglycine; D-(trifluoromethyl)-phenylglycine; D-(trifluoromethyl)-phenylalanine; D-p- 
fluorophenylalanine; D- or L-p-biphenylphenylalanine; D- or L-p- 
methoxybiphenylphenylalanine; D- or L-2-indole(alkyl)alanines; and, D- or L- 
alkylalanines, where the alkyl group can be a substituted or unsubstituted methyl, ethyl, 
propyl, hexyl, butyl, pentyl, isopropyl, iso-butyl, sec-isotyl, iso-pentyl, or a non-acidic 
amino acids. Aromatic rings of a non-natural amino acid include, e.g., thiazolyl, 
thiophenyl, pyrazolyl, benzimidazolyl, naphthyl, furanyl, pyrrolyl, and pyridyl aromatic 
rings. Modified peptides that have various amino acid mimetics or unnatural amino acids 
are particularly useful, as they tend to manifest increased stability in vivo. Such peptides 
may also possess improved shelf-life or manufacturing properties. 

[00207] Peptide stability can be assayed in a number of ways. For instance, peptidases and 
various biological media, such as human plasma and serum, have been used to test 
stability. See, e.g., Verhoef, et ah, Eur. J. Drug Metab. Pharmacokinetics 11:291 (1986). 
Half-life of the peptides of the present invention is conveniently determined using a 25% 
human serum (v/v) assay. The protocol is generally as follows: Pooled human serum 
(Type AB, non-heat inactivated) is delipidated by centrifugation before use. The serum is 
then diluted to 25% with RPMI-1640 or another suitable tissue culture medium. At 
predetermined time intervals, a small amount of reaction solution is removed and added to 
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either 6% aqueous trichloroacetic acid (TCA) or ethanol. The cloudy reaction sample is 
cooled (4°C) for 15 minutes and then spun to pellet the precipitated serum proteins. The 
presence of the peptides is then determined by reversed-phase HPLC using stability- 
specific chromatography conditions. 
[00208] As indicated above, the peptides in accordance with the invention can be a variety 
of lengths, and either in their neutral (uncharged) forms or in forms which are salts. The 
peptides in accordance with the invention can contain modifications such as glycosylation, 
side chain oxidation, or phosphorylation, generally subject to the condition that 
modifications do not destroy the biological activity of the peptides. 
[00209] The peptides of the invention may be lyophylized, or may be in crystal form. 
[00210] It is generally preferable that the variant peptide epitope be as small as possible 
while still mamtaining substantially all of the immunologic activity of the native protein. 
When possible, it may be desirable to optimize HLA class I binding epitopes of the 
invention to a length of about 8 to about 13 amino acid residues, for example, 8, 9, 10, 11, 
12 or 13, preferably 8 to 1 1 or 9 to 10. It is to be appreciated that one or more epitopes iii 
this size range can be comprised by a longer peptide (see the Definition Section for the 
term "epitope" for further discussion of peptide length). HLA class II binding epitopes are 
preferably optimized to a length of about 6 to about 30 amino acids in length, e.g., 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29 or 30, 
preferablyto between about 13 and about 20 residues, e.g., 13, 14, 15, 16, 17^ 18, 19 or 20. 
Preferably, the epitopes are commensurate in size with endogenously processed pathogen- 
derived peptides or tumor cell peptides that are bound to the relevant HLA molecules. The 
identification and preparation of peptides of various lengths can be carried out using the 
techniques described herein. 
[00211] Peptides in accordance with the invention can be prepared synthetically, by 
recombinant DNA technology or chemical synthesis, or can be isolated from natural 
sources such as native tumors or pathogenic organisms. Epitopes may be synthesized 
individually or joined directly or indirectly in a peptide. Although the peptide will 
preferably be substantially free of other naturally occurring host cell proteins and 
fragments thereof, in some embodiments the peptides may be synthetically conjugated to 
be joined to native fragments or particles. 
[00212] The peptides of the invention can be prepared in a wide variety of ways. For 
relatively short sizes, the peptides can be synthesized in solution or on a solid support in 
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accordance with conventional techniques. Various automatic synthesizers are 
commercially available and can be used in accordance with known protocols. (See, for 
example, Stewart & Young, Solid Phase Peptide Synthesis, 2d. ed., Pierce Chemical 
Co., 1984). Further, individual peptides can be joined using chemical ligation to produce 
larger peptides that are still within the bounds of the invention. 
[00213] Alternatively, recombinant DNA technology can be employed wherein a 
nucleotide sequence which encodes a peptide inserted into an expression vector, 
transformed or transfected into an appropriate host cell and cultivated under conditions 
suitable for expression. These procedures are generally known in the art, as described 
generally in Sambrook et ah, Molecular Cloning, a Laboratory Manual, Cold 
Spring Harbor Press, Cold Spring Harbor, New York (1989). Thus, recombinant peptides, 
which comprise or consist of one or more epitopes of the invention, can be used to present 
the appropriate T cell epitope. 

[00214] Polynucleotides encoding each of the peptides above are also part of the invention. 
As appreciated by one of ordinary skill in the art, various nucleic acids will encode the 
same peptide due to the redundancy of the genetic code. Each of these nucleic acids falls 
within the scope of the present invention. This embodiment of the invention comprises 
DNA and RNA and in certain embodiments a combination of DNA and RNA. It is to be 
appreciated that any polynucleotide that encodes a peptide in accordance with the 
invention falls within the scope of this invention. 
[00215] The polynucleotides encoding peptides contemplated herein can be synthesized by 
chemical techniques, for example, the phosphotriester method of Matteucci, et al, J. Am. 
Chem. Soc. 103:3185 (1981). Polynucleotides encoding peptides comprising or consisting 
of a variant can be made simply by substituting the appropriate and desired nucleic acid 
base(s) for those that encode a related (e.g., analogous) epitope. 
[00216] The polynucleotide, e.g. minigene (see below), may be produced by assembling 
oligonucleotides that encode the plus and minus strands of the polynucleotide, e.g. 
minigene. Overlapping oligonucleotides (15-100 bases long) may be synthesized, 
phosphorylated, purified and annealed under appropriate conditions using well known 
techniques. The ends of the oligonucleotides can be joined, for example, using T4 DNA 
ligase. A polynucleotide, e.g. minigene, encoding the peptide of the invention, can be 
cloned into a desired vector such as an expression vector. The coding sequence can then 
be provided with appropriate linkers and ligated into expression vectors commonly 
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available in the art, and the vectors used to transform suitable hosts to produce the desired 
peptide such as a fusion protein. 
(002171 A large number of such vectors and suitable host systems are known to those of 
skill in the art, and are commercially available. The following vectors are provided by 
way of example. Bacterial: pQE70, pQE60, pQE-9 (Qiagen), pBS, pDIO, phagescript 
psiX174, pBluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene)- 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRTT5 (Pharmacia); p C R (Mvitrogen)' 
Eukaryotic: pWIJMEO, P SV2CAT, pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV 
pMSG, pSVL (Pharmacia); P 75.6 (valentis); pCEP (Invitrogen); pCEI (Epimmune)' 
However, any other plasmid or vector can be used as long as it is replicable and viable in 
the host. 

[00218J As representative examples of appropriate hosts, there can be mentioned: bacterial 
cells, such as E. coli, Bacillus subtilis, Salmonella typhimurium and various species within 
the genera Pseudomonas, Streptomyces, and Staphylococcus; fungal cells, such as yeasf 
insect cells such as Drosophila and Sf9; animal cells such as COS-7 lines of monkey 
kidney fibroblasts, described by Gluzman, Cell 23:175 (1981), and other cell lines capable 
of expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and BHK cell 
lines or Bowes melanoma; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

[00219] Thus, the present invention is also directed to vectors, preferably expression 
vectors useful for the production of the peptides of the present invention, and to host cells 
comprising such vectors. 

[00220] Host cells are genetically engineered (transduced or transformed or transfected) 
wxth the vectors of this invention which can be, for example, a cloning vector or an 
expression vector. The vector can be, for example, in the form of a plasmid, a viral 
parade, a phage, etc. The engineered host cells can be cultured in conventional nutrient 
media modified as appropriate for activating promoters, selecting transformants or 
amphfying the polynucletides. The culture conditions, such as temperature, pH and the 
like, are those previously used with the host cell selected for expression, and will be 
apparent to the ordinarily skilled artisan. 

[00221] For expression of the peptides, the coding sequence will be provided with operably 
linked start and stop codons, promoter and terminator regions and usually a replication 
system to provide an expression vector for expression in the desired cellular host. For 
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example, promoter sequences compatible with bacterial hosts are provided in plasmids 
containing convenient restriction sites for insertion of the desired coding sequence. The 
resulting expression vectors are transformed into suitable bacterial hosts. 

[00222] Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 
expressed gene to direct transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycolytic enzymes such as 
3-phosphoglycerate kinase (PGK), V-factor, acid phosphatase, or heat shock proteins, 
among others. The heterologous structural sequence is assembled in appropriate phase 
with translation initiation and termination sequences, and preferably, a leader sequence 
capable of directing secretion of translated protein into the periplasmic space or 
extracellular medium. Optionally, the heterologous sequence can encode a fusion protein 
including an N-terminal identification peptide imparting desired characteristics, e.g., 
stabilization or simplified purification of expressed recombinant product. 

[00223] Yeast, insect or mammalian cell hosts may also be used, employing suitable 
vectors and control sequences. Examples of mammalian expression systems include the 
COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981), 
and other cell lines capable of expressing a compatible vector, for example, the C127, 
3T3, CHO, HeLa and BHK cell lines. Mammalian expression vectors will comprise an 
origin of replication, a suitable promoter and enhancer, and also any necessary ribosome 
binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. Such promoters may 
also be derived from viral sources, such as, e.g., human cytomegalovirus (CMV-IE 
promoter) or herpes simplex virus type-1 (HSV TK promoter). Nucleic acid sequences 
derived from the SV40 splice, and polyadenylation sites can be used to provide the 
required nontranscribed genetic elements. 

[00224] Polynucleotides encoding peptides of the invention may also comprise a 
ubiquitination signal sequence, and/or a targeting sequence such as an endoplasmic 
reticulum (ER) signal sequence to facilitate movement of the resulting peptide into the 
endoplasmic reticulum. 

[00225] Polynucleotides of the invention, e.g., minigenes, may be expressed in human 
cells. A human codon usage table can be used to guide the codon choice for each amino 
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acid. Such polynucleotides preferably comprise spacer amino acid residues between 
variants, such as those described above, or may comprise naturally-occurring flanking 
sequences adjacent to the variants (and/or CTL and HTL epitopes). 
[00226] The peptides of the invention can also be expressed by viral or bacterial vectors. 
Examples of expression -vectors include attenuated viral hosts, such as vaccinia or 
fowlpox. As an example of this approach, vaccinia virus is used as a vector to express 
nucleotide sequences that encode the peptides of the invention. Vaccinia vectors and 
methods useful in immunization protocols are described in, e.g., U.S. Patent No. 
4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are described 
in Stover et al, Nature 351:456-460 (1991). A wide variety of other vectors useful for 
therapeutic administration or immunization of the polypeptides of the invention, e.g. 
adeno and adeno-associated virus vectors, retroviral vectors, Salmonella typhi vectors, 
detoxified anthrax toxin vectors, and the like, will be apparent to those skilled in the art 
from the description herein. A preferred vector is Modified Vaccinia Ankara (MVA) (e.g., 
Bavarian Noridic (MVA-BN)). 

[00227] Standard regulatory sequences well known to those of skill in the art are preferably 
included in the vector to ensure expression in the human target cells. Several vector 
elements are desirable: a promoter with a downstream cloning site for polynucleotide, e.g., 
minigene insertion; a polyadenylation signal for efficient transcription termination; an E. 
coli origin of replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin 
resistance). Numerous promoters can be used for this purpose, e.g., the human 
cytomegalovirus (hCMV) promoter. See, e.g., U.S. Patent Nos. 5,580,859 and 5,589,466 
for other suitable promoter sequences. A preferred promoter is the CMV-IE promoter. 

[00228] Polynucleotides, e.g. minigenes, may comprise one or more synthetic or naturally- 
occurring introns in the transcribed region. The inclusion of mRNA stabilization 
sequences and sequences for replication in mammalian cells may also be considered for 
increasing polynucleotide, e.g. minigene, expression. 

[00229] In addition, the polynucleotide, e.g. minigene, may comprise immunostimulatory 
sequences (ISSs or CpGs); These sequences may be included in the vector, outside the 
polynucleotide (e.g. minigene) coding sequence to enhance immunogenicity. 

[00230] In some embodiments, a bi-cistronic expression vector which allows production of 
both the polynucleotide- (e.g. minigene-) encoded peptides of the invention and a second 
protein (e.g., one that modulates immunogenicity) can be used. Examples of proteins or 
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polypeptides that, if co-expressed with peptides of the invention, can enhance an immune 
response include cytokines (e.g., JL-2, IL-12, GM-CSF), cytokme-inducing molecules 
(e.g., LeIF), costimulatory molecules, or pan-DR binding proteins (PADRE® molecules, 
Epimmune, San Diego, CA). Helper T cell (HTL) epitopes such as PADRE® molecules 
can be joined to intracellular targeting signals and expressed separately from expressed 
peptides of the invention. Specifically decreasing the immune response by co-expression 
of immunosuppressive molecules (e.g. TGF-P) may be beneficial in certain diseases. 
[00231] Once an expression vector is selected, the polynucleotide, e.g. minigene, is cloned 
into the polylinker region downstream of the promoter. This plasmid is transformed into 
an appropriate bacterial strain, and DNA is prepared using standard techniques. The 
orientation and DNA sequence of the polynucleotide, e.g. minigene, as well as all other 
elements included in the vector, are confirmed using restriction mapping, DNA sequence 
analysis, and/or PCR analysis. Bacterial cells harboring the correct plasmid can be stored 
as cell banks. 

[00232] Therapeutic/prophylactic quantities of DNA can be produced for example, by 
fermentation in E. coli, followed by purification. Aliquots from the working cell bank are 
used to inoculate growth medium, and are grown to saturation in shaker flasks or a 
bioreactor according to well known techniques. Plasmid DNA is purified using standard 
bioseparation technologies such as solid phase anion-exchange resins available, e.g., from 
QIAGEN, Inc. (Valencia, California). If required, supercoiled DNA can be isolated from 
the open circular and linear forms using gel electrophoresis or other methods. 
[00233] Purified polynucleotides, e.g. minigenes, can be prepared for injection using a 
variety of formulations. The simplest of these is reconstitution of lyophilized 
polynucleotide, e.g. DNA, in sterile phosphate-buffer saline (PBS). This approach, known 
as "naked DNA," is currently being used for intramuscular (IM) administration in clinical 
trials. To maximize the immunotherapeutic effects of polynucleotide vaccines, alternative 
methods of formulating purified plasmid DNA may be used. A variety of such methods 
have been described, and new techniques may become available. Cationic lipids, 
glycolipids, and fusogenic liposomes can also be used in the formulation (see, e.g., WO 
93/24640; Mannino & Gould-Fogerite, BioTechniques 6(7): 682 (1988); U.S. Patent No. 
5,279,833; WO 91/06309; and Feigner, et al, Proc. Nat 'I Acad. Sci. USA 84:7413 (1987). 
In addition, peptides and compounds referred to collectively as protective, interactive, 
non-condensing compounds (PINC) can also be complexed to purified plasmid DNA to 
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influence variables such as stability, intramuscular dispersion, or trafficking to specific 
organs or cell types. 

[00234] Known methods in the art can be used to enhance delivery and uptake of a 
polynucleotide in vivo. For example, the polynucleotide can be complexed to 
polyvinylpyrrolidone (PVP), to prolong the localized bioavailability of the polynucleotide, 
thereby enhancing uptake of the polynucleotide by the organisum (see e.g., U.S. Patent 
No. 6,040,295; EP 0 465 529; WO 98/17814). PVP is a polyamide that is known to form 
complexes with a wide variety of substances, and is chemically and physiologically inert. 

[00235] Target cell sensitization can be used as a functional assay of the expression and 
HLA class I presentation of polynucleotide- (e.g. niinigene-) encoded peptides. For 
example, the polynucleotide, e.g. plasmid DNA, is introduced into a mammalian cell line 
that is a suitable target for standard CTL chromium release assays. The transfection 
method used will be dependent on the final formulation. For example, electroporation can 
be used for "naked" DNA, whereas cationic lipids or PVP-formulated DNA allow direct in 
vitro transfection. A plasmid expressing green fluorescent protein (GFP) can be co- 
transfected to allow enrichment of transfected cells using fluorescence activated cell 
sorting (FACS). The transfected cells are then chromium-51 ( 51 Cr) labeled and used as 
targets for epitope-specific CTLs. Cytolysis of the target cells, detected by 51 Cr release, 
indicates both production and HLA presentation of, polynucleotide-, e.g. minigene-, 
encoded variants of the invention, or peptides comprising them. Expression of HTL 
epitopes may be evaluated in an analogous manner using assays to assess HTL activity. 

[00236] In vivo immunogenicity is a second approach for functional testing of 
polynucleotides, e.g. minigenes. Transgenic mice expressing appropriate human HLA 
proteins are immunized with the polynucleotide, e.g. DNA, product. The dose and route 
of administration are formulation dependent (e.g., IM for polynucleotide (e.g., naked DNA 
or PVP-formulated DNA) in PBS, intraperitoneal (IP) for lipid-complexed polynucleotide 
(e.g., DNA)). Eleven to twenty-one days after immunization, splenocytes are harvested 
and restimulated for one week in the presence of polynucleotides encoding each peptide 
being tested. Thereafter, for peptides comprising or consisting of variants, standard assays 
are conducted to determine if there is cytolysis of peptide-loaded, 5, Cr-labeled target cells. 
Once again, lysis of target cells that were exposed to variants corresponding to those 
encoded by the polynucleotide (e.g. minigene) demonstrates polynucleotide (e.g., DNA) 
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vaccine function and induction of CTLs. Immunogenicity of HTL epitopes is evaluated in 
transgenic mice in an analogous manner. 
[00237] Alternatively, the nucleic acids can be administered using ballistic delivery as 
described, for instance, in U.S. Patent No. 5,204,253. Using this technique, particles 
comprised solely of a polynucleotide such as DNA are administered. In a further 
alternative embodiment for ballistic delivery, polynucleotides such as DNA can be 
adhered to particles, such as gold particles. 
[00238] The use of polynucleotides such as multi-epitope minigenes is described herein and 
in, e.g. co-pending application U.S.S.N. 09/31 1,784; Ishioka et al., J. Immunol. 162:3915- 
3925, 1999; An, L. and Whitton, J. L., J. Virol 71:2292, 1997; Thomson, S. A. et al, J. 
Immunol. 157:822, 1996; Whitton, J. L. et al., J. Virol. 67:348, 1993; Hanke, R. et al., 
Vaccine 16:426, 1998. For example, a polynucleotide such as a multi-epitope DNA 
plasmid can be engineered which encodes an epitope derived from multiple regions of a 
infectious agent (e.g., p53, HER2/nev, MAGE-2/3, or CEA), a pan-DR binding peptide 
such as the PADRE® universal helper T cell epitope, and an endoplasmic reticulum- 
translocating signal sequence. As descibed in the sections above, a peptide/polynucleotide 
may also comprise/encode epitopes that are derived from other infectious agents. 
[00239] Thus, the invention includes peptides as described herein, polynucleotides 
encoding each of said peptides, as well as compositions comprising the peptides and 
polynucleotides, and includes methods for producing and methods of using the peptides, 
polynucleotides, and compositions, as further described below. 
[00240] Compositions. In other embodiments, the invention is directed to a composition 
comprising one or more peptides and/or polynucleotides of the invention and optionally 
another components). 

[00241] In some embodiments, the composition comprises or consists of multiple peptides, 
e.g., 2, 3, 4, 5, 6, 7, 8, or 9 peptides of the invention. In some embodiments, the 
composition comprises at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, or at 
least 8 peptides of the invention. Combinations of peptides include, for example, a peptide 
comprising or alternatively consisting of the Gag 545 variant EPLTSLKSLF (SEQ ID 
NO: J and a peptide comprising or alternatively consisting of the Gag 545 variant 
YPLASLKSLF (SEQ ID NO:_), or combinations of peptides from different tables in 
Tables 6-9 and/or Figures 1 A-4. 
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[00242] Compositions of the invention may comprise polynucleotides encoding the above 

peptides and/or combinations of peptides. 

[00243] The composition can comprise at least 2, at least 3, at least 4, at least 5, at least 6, 

at least 7, or at least 8 peptides and/or polynucleotides selected from those described above 
or below. At least one of the one or more peptides can be a heteropolymer or a 
homopolymer. Additionally, the composition can comprise a CTL and/or HTL epitope, 
which can be derived from a tumor-associated antigen. The additional epitope can also be 
a PanDR binding molecule, {e.g., a PADRE® universal helper T cell epitope). 

[00244] Optional components include excipients, diluents, proteins such as peptides 

comprising a CTL epitope, and/or an HTL epitope such as a pan-DR binding peptide {e.g., 
a PADRE® universal helper T cell epitope), and/or a carrier, polynucleotides encoding 
such proteins, lipids, or liposomes, as well as other components described herein. There 
are numerous embodiments of compositions in accordance with the invention, such as a 
cocktail of one or more peptides and/or polynucleotides {e.g., minigenes); a cocktail of 
one or more peptides and/or polynucleotides {e.g., minigenes) and one or more CTL 
and/or HTL epitopes. 

[00245] Compositions may comprise one or more peptides (and/or polynucleotides such as 

minigenes) of the invention, along with one or more other components as described above 
and herein. "One or more*' refers to any whole unit integer from 1-150, e.g., at least 2, 3, 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 
30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 , 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 
105, 110, 115, 120, 125, 130, 135, 140, 145, or 150 peptides, polynucleotides, or other 
components. 

[00246] Compositions of the invention may be, for example, polynucleotides or 

polypeptides of the invention combined with or complexed to cationic lipid formulations; 
lipopeptides (e.g.,Vitiello, A. et aL, J. Clin. Invest. 95:341, 1995), encapsulated e.g., in 
poly(DL-lactide-co-glycolide) ("PLG") microspheres {see, e.g., Eldridge, et aL, Molec. 
Immunol. 28:287-294, 1991: Alonso et aL, Vaccine 12:299-306, 1994; Jones et aL, 
Vaccine 13:675-681, 1995); peptide compositions contained in immune stimulating 
complexes (ISCOMS) {see, e.g., Takahashi et aL, Nature 344:873-875, 1990; Hu et aL, 
Clin Exp Immunol. 113:235-243, 1998); multiple antigen peptide systems (MAPs) {see 
e.g., Tam, J. P., Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413, 1988; Tarn, J.P., J. Immunol. 
Methods 196:17-32, 1996); viral, bacterial, or, fungal delivery vectors (Perkus, M. E. et 
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al, In: Concepts in vaccine development, Kaufinann, S. H. E., ed., p. 379, 1996; 
Chakrabarti, S. et al, Nature 320:535, 1986; Hu, S. L. et al., Nature 320:537, 1986; 
Kieny, M.-P. et al., AIDS Bio/Technology 4:790, 1986; Top, F. H. et al, J. Infect. Dis. 
124:148, 1971; Cfaanda, P. K. et al., Virology 175:535, 1990); particles of viral or 
synthetic origin [e.g., Kofler, N. et al, J. Immunol. Methods. 192:25, 1996; Eldridge, J. H. 
et al, Sem. Hematol. 30:16, 1993; Falo, L. D., Jr. et al, Nature Med. 7:649, 1995); 
adjuvants (e.g., incomplete Freund's adjuvant) (Warren, H. S., Vogel, F. R, and Chedid, 
L. A. Annu. Rev. Immunol. 4:369, 1986; Gupta, R. K. et al, Vaccine 11:293, 1993); 
liposomes (Reddy, R. et al, J. Immunol 148:1585, 1992; Rock, K. L., Immunol Today 
17:131, 1996); or, particle-absorbed cDNA or other polynucleotides of the invention 
(Ulmer, J. B. et al, Science 259:1745, 1993; Robinson, H. L., Hunt, L. A., and Webster, 
R. G., Vaccine 11:957, 1993; Shiver, J. W. et al, In: Concepts in vaccine development, 
Kaufinann, S. H. E., ed., p. 423, 1996; Cease, K. B., and Berzofcky, J. A., Annu. Rev. 
Immunol. 12:923, 1994 and Eldridge, J. H. et al, Sem. Hematol 30:16, 1993), etc. Toxin- 
targeted delivery technologies, also known as receptor mediated targeting, such as those of 
Avant hnmunotherapeutics, Inc. (Needham, Massachusetts) or attached to a stress protein, 
e.g., HSP 96 (Stressgen Biotechnologies Corp., Victoria, BC, Canada) can also be used. 
[00247] Compositions of the invention comprise polynucleotide-mediated modalities. 
DNA or RNA encoding one or more of the peptides of the invention can be adrninistered 
to a patient. This approach is described, for instance, in Wolff et. al, Science 247:1465 
(1990) as well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 
5,679,647; and, WO 98/04720. Examples of DNA-based delivery technologies include 
"naked DNA", facilitated (bupivicaine, polymers {e.g., PVP, PINC, etc.), peptide- 
mediated) delivery, cationic lipid complexes, and particle-mediated ("gene gun") or 
pressure-mediated delivery (see, e.g., U.S. Patent No. 5,922,687). Accordingly, peptides of 
the invention can be expressed by viral or bacterial vectors. Examples of expression 
vectors include attenuated viral hosts, such as Modified Vaccinia Ankara (MVA) (e.g., 
Bavarian Noridic), vaccinia or fowlpox. For example, vaccinia virus is used as a vector to 
express nucleotide sequences that encode the peptides of the invention. Upon introduction 
into an acutely or chronically infected host or into a non-infected host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune 
response. Vaccinia vectors and methods useful in immunization protocols are described 
in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). 
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BCG vectors are described in Stover et al, Nature 351 .456-460 (1991). A wide variety of 
other vectors useful for therapeutic adrninistration or immunization of the peptides of the 
invention, e.g. adeno and adeno-associated virus vectors, alpha virus vectors, retroviral 
vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, are 
apparent to those skilled in the art from the description herein. 
[00248] In certain embodiments, components that induce T cell responses are combined 
with components that induce antibody responses to the target antigen of interest. A 
preferred embodiment of such a composition comprises class I and class H epitopes in 
accordance with the invention. Alternatively, a composition comprises a class I and/or 
class H epitope in accordance with the invention, along with a PADRE® molecule 
(Epimmune, San Diego, CA). 
[00249] Compositions of the invention can comprise antigen presenting cells, such as 
dendritic cells. Antigen presenting cells, e.g., dendritic cells, may be transfected, e.g., with 
a polynucleotide such as a minigene construct in accordance with the invention, in order to 
elicit immune responses. The peptide can be bound to an HLA molecule on the antigen- 
resenting cell, whereby when an HLA-restricted cytotoxic T lymphocyte (CTL) is present, 
a receptor of the CTL binds to a complex of the HLA molecule and the peptide. 
[00250] The compositions of the invention may also comprise antiviral drugs such as 

interferon-oc, or immune adjuvants such as IL-12, GM-CSF, etc. 
[00251] Compositions may comprise an HLA heavy chain, p 2 -microglobulin, streptavidin, 

and/or biotin. The streptavidin may be fluorescently labeled. Compositions may comprise 

tetramers (see e.g., U.S. Pat. No. 5,635,363; Science 274:94-96 (1996)). A tetramer 

composition comprising an HLA heavy chain, 0 2 -rmcroglobuIin, streptavidin, and biotin. 

The streptavidin may be fluorescently labeled. Compositions may also comprise aimers. 

A dimer composition comprises as MHC molecule and an Ig molecule (see e.g., PNAS 

95:7568-73 (1998)). 

[00252] In some embodiments it may be desirable to include in the compositions of the 
invention at least one component which primes cytotoxic T lymphocytes. Lipids have 
been identified as agents capable of priming CTL in vivo against viral antigens. For 
example, palmitic acid residues can be attached to the s-and a- amino groups of a lysine 
residue and then linked, e.g, via one or more linking residues such as Gly, Gly-Gly-, Ser, 
Ser-Ser, or the like, to an immunogenic peptide. The lipidated peptide can then be 
administered either directly in a micelle or particle, incorporated into a liposome, or 
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emulsified in an adjuvant, e.g., incomplete Freund's adjuvant A preferred composition 
comprises palmitic acid attached to e- and a- amino groups of Lys, which is attached via 
linkage, e.g., Ser-Ser, to the amino terminus of the peptide. 
[00253] As another example of lipid priming of CTL responses, E. coli lipoproteins, such 
as tripalnfitoyl-S-glycerylcysteimyseryl-serine (P 3 CSS) can be used to prime virus specific 
CTL when covalently attached to an appropriate peptide {see, e.g., Deres, et al, Nature 
342:561, 1989). Peptides of the invention can be coupled to P 3 CSS, for example, and the 
lipopeptide administered to an individual to specifically prime a CTL response to the 
target antigen. Moreover, because the induction of neutralizing antibodies can also be 
primed with P 3 CSS-conjugated epitopes, two such compositions can be combined to more 
effectively elicit both humoral and cell-mediated responses. 
[00254] Another preferred embodiment is a composition comprising one or more peptides 

of the invention emulsified in IFA. 
[00255] Compositions of the invention may also comprise CTL and/or HTL peptides. Such 
CTL and HTL peptides can be modified by the addition of amino acids to the termini of a 
peptide to provide for ease of linking peptides one to another, for coupling to a carrier 
support or larger peptide, for modifying the physical or chemical properties of the peptide 
or oligopeptide, or the like. Amino acids such as tyrosine, cysteine, lysine, glutamic or 
aspartic acid, or naturally or unnaturally occuring amino acid residues, can be introduced 
at the carboxyl- or ammo-terminus of the peptide or oligopeptide, particularly class I 
peptides. However, it is to be noted that modification at the carboxyl terminus of a CTL 
epitope may, in some cases, alter binding characteristics of the peptide. In addition, the 
peptide or oligopeptide sequences can differ from the natural sequence by being modified 
by terminal-NH 2 acylation, e.g., by alkanoyl ( Cl -C 20 ) or thioglycolyl acetylation, terminal- 
carboxyl amidation, e.g., ammonia, methylamine, etc. In some instances these 
modifications may provide sites for linking to a support or other molecule. CTL and HTL 
epitopes may comprise additional amino acids, such as those described above including 
spacers. 

[00256] A further embodiment of a composition in accordance with the invention is an 
antigen presenting cell that comprises one or more peptides in accordance with the 
invention. The antigen presenting cell can be a "professional" antigen presenting cell, 
such as a dendritic cell. The antigen presenting cell can comprise the peptide of the 
invention by any means known or to be determined in the art. Such means include pulsing 
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of dendritic cells with one or more individual peptides, by nucleic acid administration such 
as ballistic nucleic acid delivery or by other techniques in the art for administration of 
nucleic acids, including vector-based, e.g. viral vector, delivery of nucleic acids. 
[00257] Compositions may comprise carriers. Carriers that can be used with compositions 
of the invention are well known in the art, and include, e.g., thyroglobulin, albumins such 
as human serum albumin, tetanus toxoid, polyamino acids such as poly L-lysine, poly L- 
glutamic acid, influenza virus proteins, hepatitis B virus core protein, and the like. 
[00258] The compositions (e.g. pharmaceutical compositions) can contain a physiologically ' 
tolerable diluent such as water, or a saline solution, preferably phosphate buffered saline. 
Additionally, as disclosed herein, CTL responses can be primed by conjugating peptides 
of the invention to lipids, such as tripalnutoyl-S-glyceryl-cysteinyl-seryl-serine (P 3 CSS). 
[00259] Compositions of the invention may be pharmaceutically acceptable compositions. 
Pharmaceutical compositions preferably, contain an immunologically effective amount of 
one or more peptides and/or polynucleotides of the invention, and optionally one or more 
other components which are pharmaceutically acceptable. A preferred composition 
comprises one or more peptides of the invention and IF A. A more preferred composition 
of the invention comprises one or more peptides of the invention, one or more peptides, 
and IF A. 

[00260] Upon immunization with a peptide and/or polynucleotide and/or composition in 
accordance with the invention, via injection (e.g., SC, ID, IM), aerosol, oral, transdermal, 
transmucosal, intrapleural, intrathecal, or other suitable routes, the immune system of the 
host responds to the vaccine by an immune response comprising the production of 
antibodies, CTLs and/or HTLs specific for the desired antigen(s). Consequently, the host 
becomes at least partially immune to subsequent exposure to the infectious agent(s), or at 
least partially resistant to further development of infectious agent-bearing cells and 
thereby derives a prophylactic or therapeutic benefit. 

[00261] Furthermore, the peptides, primers, and epitopes of the invention can be used in 
any desired immunization or administration regimen; e.g., as part of periodic vaccinations 
such as annual vaccinations as in the veterinary arts or as in periodic vaccinations as in the 
human medical arts, or as in a prime-boost regime wherein an inventive vector or 
recombinant is administered either before or after the administration of the same or of a 
different epitope of interest or recombinant or vector expressing such as a same or 
different epitope of interest (including an inventive recombinant or vector expressing such 
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as a same or different epitope of interest), see, e.g., U.S. Pat. Nos. 5,997,878; 6,130,066; 
6,180,398; 6,267,965; and 6,348,450. An useful viral vector of the present invention is 
Modified Vaccinia Ankara (MVA) (e.g., Bavarian Noridic (MVA-BN)). 
[00262] Recent studies have indicated that a prime-boost protocol, whereby immunization 
with a poxvirus recombinant expressing a foreign gene product is followed by a boost 
using a purified subunit preparation form of that gene product, elicits an enhanced immune 
response relative to the response elicited with either product alone. Human volunteers 
immunized with a vaccinia recombinant expressing the HTV-1 envelope glycoprotein and 
boosted with purified HTV-1 envelope glycoprotein subunit preparation exhibit higher 
HTV-1 neutralizing antibody titers than individuals immunized with just the vaccinia 
recombinant or purified envelope glycoprotein alone (Graham et al, J. Infect. Dis., 
167:533-537 (1993); Cooney et al, Proc. Natl. Acad. Sci. USA, 90:1882-1886 (1993)). 
Humans immunized with two injections of an ALVAC-HTV-1 env recombinant (vCP125) 
failed to develop HTV specific antibodies. Boosting with purified rgpl60 from a vaccinia 
virus recombinant resulted in detectable HTV-1 neutralizing antibodies. Furthermore, 
specific lymphocyte T cell proliferation to rgpl60 was clearly increased by the boost with 
rgpl60. Envelope specific cytotoxic lymphocyte activity was also detected with this 
vaccination regimen (Pialoux et al, AIDS Res. and Hum. Retroviruses, 11:272-381 
(1995)). Macaques immunized with a vaccinia recombinant expressing the simian 
immunodeficiency virus (STV) envelope glycoprotein and boosted with STV envelope 
glycoprotein from a baculovirus recombinant are protected against STV challenge (Hu et 
al, AID Res. and Hum. Retroviruses, 3:615-620 (1991); Hu et al, Science 255:456-459 
(1992)). In the same fashion, purified HCMVgB protein can be used in prime-boost 
protocols with NYVAC or ALVAC-gB recombinants. 
[00263] In certain embodiments, the polynucleotides are complexed in a liposome 
preparation. Liposomal preparations for use in the instant invention include cationic 
(positively charged), anionic (negatively charged) and neutral preparations. However, 
cationic liposomes are particularly preferred because a tight charge complex can be 
formed between the cationic liposome and Ihe polyanionic nucleic acid. Cationic 
liposomes have been shown to mediate intracellular delivery of plasmid DNA (Feigner et 
al, Proc. Natl Acad. Sci. USA 5*74137416 (1987), which is herein incorporated by 
reference); mRNA (Malone et al, Proc. Natl. Acad. Sci. USA 55:60776081 (1989), which 
is herein incorporated by reference); and purified transcription factors (Debs et al, J. Biol. 
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Chem. 255:1018910192 (1990), which is herein incorporated by reference), in functional 
form. 

[00264] Cationic liposomes are readily available. For example, N-[12,3-dioleyloxy)- 
propyl]-N^M,N-triethylammonium (DOTMA) liposomes are particularly useful and are 
available under the trademark Lipofectin, from GIBCO BRL, Grand Island, N.Y. (See, 
also, Feigner et al, Proc. Natl Acad. Sci. USA £4:74137416 (1987)). Other commercially 
available liposomes include transfectace (DDAB/DOPE) and DOTAP/DOPE 
(Boehringer). 

[00265] Other cationic liposomes can be prepared from readily available materials using 
techniques well known in the art. See, e.g. PCT Publication No. WO 90/11092 for a 
description of the synthesis of DOTAP (l,2-bis(oleoyloxy)-3-(trimethylammonio)propane) 
liposomes. Preparation of DOTMA liposomes is explained in the literature, see, e.g., P. 
Feigner et al, Proc. Natl. Acad. Sci. USA W:74137417. Similar methods can be used to 
prepare liposomes from other cationic lipid materials, 

[00266] Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, Ala.), or can be easily prepared using readily available 
materials. Such materials include phosphatidyl, choline, cholesterol, phosphatidyl 
ethanolamine, dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol 
(DOPG), dioleoylphoshatidyl ethanolamine (DOPE), among others. These materials can 
also be mixed with the DOTMA and DOTAP starting materials in appropriate ratios. 
Methods for making liposomes using these materials are well known in the art. 

[00267] For example, commercially available dioleoylphosphatidyl choline (DOPC), 
dioleoylphosphatidyl glycerol (DOPG), and dioleoylphosphatidyl ethanolamine (DOPE) 
can be used in various combinations to make conventional liposomes, with or without the 
addition of cholesterol. Thus, for example, DOPG/DOPC vesicles can be prepared by 
drying 50 mg each of DOPG and DOPC under a stream of nitrogen gas into a sonication 
vial. The sample is placed under a vacuum pump overnight and is hydrated the following 
day with deionized water. The sample is then sonicated for 2 hours in a capped vial, using 
a Heat Systems model 350 sonicator equipped with an inverted cup (bath type) probe at 
the maximum setting while the bath is circulated at 15EC. Alternatively, negatively 
charged vesicles can be prepared without sonication to produce multilamellar vesicles or 
by extrusion through nucleopore membranes to produce unilamellar vesicles of discrete 
size. Other methods are known and available to those of skill in the art. 
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[00268] The liposomes can comprise multilamellar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs), with SUVs being preferred. The 
various liposome nucleic acid complexes are prepared using methods well known in the 
art. See, e.g., Straubinger et al, Methods of Immunology 101:512527 (1983). For 
example, MLVs containing nucleic acid can be prepared by depositing a thin film of 
phospholipid on the walls of a glass tube and subsequently hydrating with a solution of the 
material to be encapsulated. SUVs are prepared by extended sonication of MLVs to 
produce a homogeneous population of unilamellar liposomes. The material to be 
entrapped is added to a suspension of preformed MLVs and then sonicated. When using 
liposomes containing cationic lipids, the dried lipid film is resuspended in an appropriate 
solution such as sterile water or an isotonic buffer solution such as 10 mM Tris/NaCl, 
sonicated, and then the preformed liposomes are mixed directly with the DNA. The 
liposome and DNA form a very stable complex due to binding of the positively charged 
liposomes to the cationic DNA. SUVs find use with small nucleic acid fragments. LUVs 
are prepared by a number of methods, well known in the art. Commonly used methods 
include Ca 2+ -EDTA chelation (Papahadjopoulos et al 9 Biochim. Biophys. Acta 394:483 
(1975); Wilson et al y Cell 17:11 (1979)); ether injection (Deamer, D. and Bangham, A.„ 
Biochim, Biophys. Acta 443:629 (1976); Ostro et al, Biochem. Biophys. Res. Commun. 
76:836 (1977); Fraley et al, Proc. Natl. Acad. Set USA 76:3348 (1979)); detergent 
dialysis (Enoch, H. and Strittmatter, P., Proc. Natl. Acad. Sci. USA 76:145 (1979)); and 
reversephase evaporation (REV) (Fraley et al, J. Biol Chem. 255:10431 (1980); Szoka, F. 
and Papahadjopoulos, D., Proc. Natl Acad. Sci. USA 75:145 (1978); SchaeferRidder et 
al, Science 215:166 (1982)). 

[00269] Generally, the ratio of DNA to liposomes will be from about 10:1 to about 1:10. 

Preferably, the ration will be from about 5:1 to about 1:5. More preferably, the ration will 
be about 3 : 1 to about 1:3. Still more preferably, the ratio will be about 1:1. 

[00270] U.S. Patent No. 5,676,954 reports on the injection of genetic material, complexed 
with cationic liposome carriers, into mice. U.S. Patent Nos. 4,897,355, 4,946,787, 
5,049,386, 5,459,127, 5,589,466, 5,693,622, 5,580,859, 5,703,055, and international 
publication no. WO 94/9469 provide cationic lipids for use in transfecting DNA into cells 
and mammals. U.S. Patent Nos. 5,589,466, 5,693,622, 5,580,859, 5,703,055, and 
international publication no. WO 94/9469 provide methods for delivering DNA-cationic 
lipid complexes to mammals. 
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Binding Affinity of Variants for HLA Molecules 

[00271] As indicated herein, the large degree of HLA polymorphism is an important factor 
to be taken into account with the epitope-based approach to developing therapeutics and 
diagnostics. To address this factor, epitope selection encompassing identification of 
peptides capable of binding at high or intermediate affinity to multiple HLA molecules is 
preferably utilized, most preferably these epitopes bind at high or intermediate affinity to 
two or more allele-specific HLA molecules. However, in some embodiments, it is 
preferred that all epitopes in a given composition bind to the alleles of a single HLA 
supertype or a single HLA molecule. 

[00272] Variants of the invention preferably include those that have an IC 50 or binding 
affinity value for a class I HLA molecule(s) of 500 nM or better (i.e., the value is < 500 
nM). In certain embodiments of the invention, peptides of interest have an IC 50 or binding 
affinity value for a class I HLA molecule(s) of 200 nM or better. In certain embodiments 
of the invention, peptides of interest, such as Al and A24 peptides, have an IC 50 or 
binding affinity value for a class I HLA molecule(s) of 100 nM or better. If HTL epitopes 
are included, they preferably are HTL epitopes that have an IC 50 or binding affinity value 
for class II HLA molecules of 1000 nM or better, (i.e., the value is < 1,000 nM). For 
example, peptide binding is assessed by testing the capacity of a candidate peptide to bind 
to a purified HLA molecule in vitro. Peptides exhibiting high or intermediate affinity are 
then considered for further analysis. Selected peptides are generally tested on other 
members of the supertype family. In preferred embodiments, peptides that exhibit cross- 
reactive binding are men used in cellular screening analyses or vaccines. 

[00273] The relationship between binding affinity for HLA class I molecules and 
immunogenicity of discrete peptide epitopes on bound antigens was determined for the 
first time by inventors at Epimmune. As disclosed in greater detail herein, higher HLA 
binding affinity is correlated with greater immunogenicity. 

[00274] Greater immunogenicity can be manifested in several different ways. 
Immunogenicity corresponds to whether an immune response is elicited at all, and to the 
vigor of any particular response, as well as to the extent of a population in which a 
response is elicited. For example, a peptide might elicit an immune response in a diverse 
array of the population, yet in no instance produce a vigorous response. In accordance 
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with these principles, close to 90% of high binding peptides have been found to elicit a 
response and thus be "immunogenic," as contrasted with about 50% of the peptides that 
bind with intermediate affinity. (See, e.g., Schaeffer et al PNAS (1988)) High affinity- 
binding class I peptides generally have an affinity of less than or equal to 100 nM. 
Moreover, not only did peptides with higher binding affinity have an enhanced probability 
of generating an immune response, the generated response tended to be more vigorous 
than the response seen with weaker binding peptides. As a result, less peptide is required 
to elicit a similar biological effect if a high affinity binding peptide is used rather than a 
lower affinity one. Thus, in some preferred embodiments of the invention, high affinity 
binding epitopes are used. 

[00275] The correlation between binding affinity and immunogenicity was analyzed by the 
present inventors by two different experimental approaches (see, e.g., Sette, et al, J. 
Immunol. 153:5586-5592 (1994)). In the first approach, the immunogenicity of potential 
epitopes ranging in HLA binding affinity over a 10,000-fold range was analyzed in HLA- 
A*0201 transgenic mice. In the second approach, the antigenicity of approximately 100 
different hepatitis B virus (HBV)-derived potential epitopes, all carrying A*0201 binding 
motifs, was assessed by using PBL from acute hepatitis patients. Pursuant to these 
approaches, it was determined that an affinity threshold value of approximately 500 nM 
(preferably 50 nM or less) determines the capacity of a peptide epitope to elicit a CTL 
response. These data are true for class I binding affinity measurements for naturally 
processed peptides and for synthesized T cell epitopes. These data also indicate the 
important role of determinant selection in the shaping of T cell responses (see, e.g., 
Schaeffer et al Proc. Natl. Acad. Sci. USA 86:4649-4653 (1989)). 
[00276] An affinity threshold associated with immunogenicity in the context of HLA class 
n (i.e., HLA DR) molecules has also been delineated (see, e.g., Southwood et al J. 
Immunology 160:3363-3373 (1998), and U.S. Patent No. 6,413,527, issued July 2, 2002). 
m order to define a biologically significant threshold of HLA class H binding affinity, a 
database of the binding affinities of 32 DR-restricted epitopes for their restricting element 
(i.e., the HLA molecule that binds the epitope) was compiled. In approximately half of the 
cases (15 of 32 epitopes), DR restriction was associated with high binding affinities, 
i.e. binding affinity values of 100 nM or less. In the other half of the cases (16 of 32), DR 
restriction was associated with intermediate affinity (binding affinity values in the 
100-1000 nM range). In only one of 32 cases was DR restriction associated with an IC 50 
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of 1000 nM or greater. Thus, 1000 nM is defined as an affinity threshold associated with 
immunogenicity in the context of DR molecules. 
[00277] The binding affinity of peptides for HLA molecules can be determined as 
described in Example 1, below. 

Enhancing Population Coverage of the Vaccine 
[00278] The primary anchor residues of the HLA class I peptide epitope supermotifs and 
motifs are summarized in Tables 1-2. Allele-specific HLA molecules that are comprised 
by the various HLA class I supertypes are listed in Table 4. In some cases, patterns of 
amino acid residues are present in both a motif and a supermotif. The relationship of a 
particular motif and any related supermotif is indicated in the description of the individual 
motifs. 

[00279] By inclusion of one or more epitopes from several motifs or supermotifs in a 
vaccine composition, enhanced population coverage for majox global ethnicities can be 
obtained. 

Assays to Detect T-Cell Responses 
[00280] Once HLA binding peptides are identified, they can be tested for the ability to 

elicit a T-cell response. The preparation and evaluation of motif-bearing peptides are 
described, e.g., in PCT publications WO 94/20127 and WO 94/03205. Briefly, peptides 
comprising epitopes from a particular antigen are synthesized and tested for their ability to 
bind to relevant HLA proteins. These assays may involve evaluation of peptide binding to 
purified HLA class I molecules in relation to the binding of a radioiodinated reference 
peptide. Alternatively, cells expressing empty class I molecules (i.e. cell surface HLA 
molecules that lack any bound peptide) may be evaluated for peptide binding by 
immunofluorescent staining and flow microfluorimetry. Other assays that may be used to 
evaluate peptide binding include peptide-dependent class I assembly assays and/or the 
inhibition of CTL recognition by peptide competition. Those peptides that bind to an 
HLA class I molecule, typically with an affinity of 500 nM or less, are further evaluated 
for their ability to serve as targets for CTLs derived from infected or immunized 
individuals, as well as for their capacity to induce primary in vitro or in vivo CTL 
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responses that can give rise to CTL populations capable of reacting with selected target 
cells associated with pathology. 

[00281] Analogous assays are used for evaluation of HLA class II binding peptides. HLA 

class II motif-bearing peptides that are shown to bind, typically at an affinity of 1000 nM 
or less, are further evaluated for the ability to stimulate HTL responses. 

[00282] Conventional assays utilized to detect T cell responses include proliferation assays, 

lymphokine secretion assays, direct cytotoxicity assays, and limiting dilution assays. For 
example, antigen-presenting cells that have been incubated with a peptide can be assayed 
for the ability to induce CTL responses in responder cell populations. Antigen-presenting 
cells can be normal cells such as peripheral blood mononuclear cells or dendritic cells. 
Alternatively, mutant, non-human mammalian cell lines that have been transfected with a 
human class I MHC gene, and that are deficient in their ability to load class I molecules 
with internally processed peptides, are used to evaluate the capacity of the peptide to 
induce in vitro primary CTL responses. Peripheral blood mononuclear cells (PBMCs) can 
be used as the source of CTL precursors. Antigen presenting cells are incubated with 
peptide, after which the peptide-loaded antigen-presenting cells are then incubated with 
the responder cell population under optimized culture conditions. Positive CTL activation 
can be determined by assaying the culture for the presence of CTLs that lyse radio-labeled 
target cells, either specific peptide-pulsed targets or target cells that express endogenously 
processed antigen from which the specific peptide was derived. Alternatively, the 
presence of epitope-specific CTLs can be detennined by IFNy in situ ELISA. 

[00283] In an embodiment of the invention, directed to diagnostics, a method has been 
devised which allows direct quantification of antigen-specific T cells by staining with 
fluorescein-labelled HLA tetrameric complexes (Altman, J. D. et al 9 Proc. Natl. Acad. 
Sci. USA 90:10330, 1993; Altman, J. D. et al 9 Science 274:94, 1996). Other options 
include staining for intracellular lymphokines, and interferon release assays or ELISPOT 
assays. Tetramer staining, intracellular lymphokine staining and ELISPOT assays all 
appear to be at least 10-fold more sensitive than more conventional assays (Lalvani, A. et 
al. 9 J. Exp. Med. 186:859, 1997; Dunbar, P. R. et al 9 Cwt. Biol. 8:413, 1998; Murali- 
Krishna, K. et al. 9 Immunity 8:177, 1998). Additionally, DimerX technology can be used 
as a means of quantitation (see, e.g. 9 Science 274:94-99 (1996) and Proc. Natl. Acad. Sci. 
95:7568-73 (1998)). 
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[00284] HTL activation may also be assessed using techniques known to those in the art, 
such as T cell proliferation or lymphokine secretion (see, e.g. Alexander et al, Immunity 
1:751-761, 1994). 

[00285] Alternatively, immunization of HLA transgenic mice can be used to determine 
immunogenicity of peptide epitopes. Several transgenic mouse strains, e.g., mice with 
human A2.1, All (which can additionally be used to analyze HLA-A3 epitopes), and B7 
alleles have been characterized. Other transgenic mice strains (e.g., transgenic mice for 
HLA-A1 and A24) are being developed. Moreover, HLA-DR1 and HLA-DR3 mouse 
models have been developed. In accordance with principles in the art, additional 
transgenic mouse models with other HLA alleles are generated as necessary. 

[00286] Such mice can be immunized with peptides emulsified in Incomplete Freund's 
Adjuvant; thereafter any resulting T cells can be tested for their capacity to recognize 
target cells that have been peptide-pulsed or transfected with genes encoding the peptide 
of interest. CTL responses can be analyzed using cytotoxicity assays described above. 
Similarly, HTL responses can be analyzed using, e.g., T cell proliferation or lymphokine 
secretion assays. 

Minigenes 

[00287] A number of different approaches are available which allow simultaneous delivery 
of multiple epitopes. Nucleic acids encoding multiple epitopes are a useful embodiment 
of the invention; discrete peptide epitopes or polyepitopic peptides can be encoded. The 
epitopes to be included in a minigene are preferably selected according to the guidelines 
set forth in the previous section. Examples of amino acid sequences that can be included in 
a minigene include: HLA class I epitopes, HLA class H epitopes, a ubiquitination signal 
sequence, and/or a targeting sequence such as an endoplasmic reticulum (ER) signal 
sequence to facilitate movement of the resulting peptide into the endoplasmic reticulum. 
Examples of minigene constructs are shown in Tables 23-28. 

[00288] The use of multi-epitope minigenes is also described in, e.g., co-pending 
applications U.S.S.N. 09/311,784, 09/894,018, 60/419,973, 60/415,463; Ishioka et al, J. 
Immunol. 162:3915-3925, 1999; An, L. and Whitton, J. L, J. Virol. 71:2292, 1997; 
Thomson, S. A. et al, J. Immunol. 157:822, 1996; Whitton, J. L. et al, J. Virol. 67:348, 
1993; Hanke, R. et al, Vaccine 16:426, 1998. For example, a multi-epitope DNA plasmid 
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encoding nine dominant HLA-A*0201- and All-restricted CTL epitopes derived from the 
polymerase, envelope, and core proteins of HBV and human immunodeficiency virus 
(HIV), a PADRE® universal helper T cell (HTL) epitope, and an endoplasmic reticulum- 
translocating signal sequence has heen engineered. Immunization of HLA transgenic mice 
with this plasmid construct resulted in strong CTL induction responses against the nine 
CTL epitopes tested. This CTL response was similar to that observed with a lipopeptide 
of known immunogenicity in humans, and significantly greater than immunization using 
peptides in oil-based adjuvants. Moreover, the immunogenicity of DNA-encoded epitopes 
in vitro was also correlated with the in vitro responses of specific CTL lines against target 
cells transfected with the DNA plasmid. These data show that the minigene served: 1.) to 
generate a CTL response and 2.) to generate CTLs that recognized cells expressing the 
encoded epitopes. A similar approach can be used to develop minigenes encoding 
epitopes of an infectious agent. 

[00289] For .example, to create a DNA sequence encoding the selected epitopes (minigene) 
for expression in human cells, the amino acid sequences of the epitopes may be reverse 
translated. A human codon usage table can be used to guide the codon choice for each 
ammo acid. These epitope-encoding DNA sequences may be directly adjoined, so that 
when translated, a continuous peptide sequence is created. However, to optimize 
expression and/or immunogenicity, additional elements can be incorporated into the 
minigene design such as spacer amino acid residues between epitopes. HLA presentation 
of CTL and HTL epitopes maybe improved by including synthetic (e.g. poly-alanine) or 
naturaUy-occurring flanking sequences adjacent to the CTL or HTL epitopes; these larger 
peptides comprising the epitope(s) are within the scope of the invention. In one 
embodiment, spacer amino acid residues between one or more CTL and/or HTL epitopes 
are designed so as to minimiz e junctional epitopes that may result from the juxtaposition 
of 2 CTL and/or HTL epitopes. 

[00290] The minigene sequence may be converted to DNA by assembling oligonucleotides 
that encode the plus and minus strands of the minigene. Overlapping oligonucleotides 
(30-100 bases long) may be synthesized, phosphbrylated, purified and annealed under 
appropriate conditions using well known techniques. The ends of the oligonucleotides can 
be joined, for example, using T4 DNA ligase. This synthetic minigene, encoding the 
epitope peptide, can then be cloned into a desired expression vector. 
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[00291] Standard regulatory sequences well known to those of skill in the art are preferably 
included in the vector to ensure expression in the target cells. Several vector elements are 
desirable: a promoter with a downstream cloning site for minigene insertion; a 
polyadenylation signal for efficient transcription termination; an E. coli origin of 
replication; and an E. coli selectable marker (e.g. ampicillin or kanamycin resistance). 
Numerous promoters can be used for this purpose, e.g., the human cytomegalovirus 
(hCMV) CMV-IE promoter. See, e.g., U.S. Patent Nos. 5,580,859 and 5,589,466 for other 
suitable promoter sequences. 

[00292] Optimized peptide expression and immunogenicity can be achieved by certain 
modifications to a minigene construct. For example, in some cases introns facilitate 
efficient gene expression, thus one or more synthetic or naturafiy-occurring introns can be 
incorporated into the transcribed region of the minigene. The inclusion of mRNA 
stabilization sequences and sequences for replication in mammalian cells may also be 
considered for increasing minigene expression. 

[00293] Once an expression vector is selected, the minigene is cloned into the polylinker 
region downstream of the promoter. This plasmid is transformed into an appropriate 
bacterial strain, and DNA is prepared using standard techniques. The orientation and 
DNA sequence of the minigene, as well as all other elements included in the vector, are 
confirmed using restriction mapping, PCR and/or DNA sequence analysis. Bacterial cells 
harboring the correct plasmid can be stored as cell banks. 

[002941 Ih addition, immunostimulatory sequences QSSs or CpGs) appear to play a role in 
the immunogenicity of DNA vaccines. These sequences may be included in the vector, 
outside the minigene coding sequence to enhance immunogenicity. 

[00295] In some embodiments, a bi-cistronic expression vector which allows production of 
both the minigene-encoded epitopes and a second protein (e.g., one that modulates 
immunogenicity) can be used. Examples of proteins or polypeptides that, if co-expressed 
with epitopes, can enhance an i mmu ne response include cytokines (e.g., IL-2, EL- 12, 
GM-CSF), cytokine-inducing molecules (e.g.. LeIF), costimulatory molecules, or pan-DR 
binding proteins (PADRE®, Epimmune, San Diego, CA). Helper T cell (HTL) epitopes 
such as PADRE® molecules can be joined to intracellular targeting signals and expressed 
separately from expressed CTL epitopes. This can be done in order to direct HTL 
epitopes to a cell compartment different than that of the CTL epitopes, one that provides 
for more efficient entry of HTL epitopes into the HLA class H pathway, thereby 
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improving HTL induction. In contrast to HTL or CTL induction, specifically decreasing 
the immune response by co-expression of immunosuppressive molecules (e.g. TGF-p) 
may be beneficial in certain diseases. 

100296] Therapeutic quantities of plasmid DNA can be produced for example, by 
fermentation in E. coli, followed by purification. Aliquots from the working cell bank are 
used to inoculate growth medium, and are grown to saturation in shaker flasks or a 
bioreactor according to well known techniques. Plasmid DNA is purified using standard 
bioseparation technologies such as solid phase anion-exchange resins available, e.g., from 
QIAGEN, Inc. (Valencia, California). If required, supercoiled DNA can be isolated from 
the open circular and linear forms using gel electrophoresis or other methods. 

[00297] Purified plasmid DNA can be prepared for injection using a variety of 
formulations. The simplest of these is reconstitution of lyophilized DNA in sterile 
phosphate-buffer saline (PBS). This approach, known as "naked DNA," is currently being 
used for intramuscular (Evl) administration . in . clinical trials. To maximize the 
( immunotherapeutic effects of minigene vaccines, alternative methods of formulating 

purified plasmid DNA may be used. A variety of such methods have been described, and 
new techniques may become available. Cationic lipids, glycolipids, and fusogenic 
liposomes can also be used in the formulation (see, e.g., WO 93/24640; Mannino & 
Gould-Fogerite, BioTechniques 6(7): 682 (1988); U.S. Patent No. 5,279,833; WO 
91/06309; and Feigner, et al, Proc. Nat 'I Acad. Set USA 84:7413 (1987). In addition, 
peptides and compounds referred to collectively as protective, interactive, non-condensing 
compounds (PINC) can also be complexed to purified plasmid DNA to influence variables 
such as stability, intramuscular dispersion, or trafficking to specific organs or cell types. 

[00298] Known methods in the art can be used to enhance delivery and uptake of a 
polynucleotide in vivo. For example, the polynucleotide can be complexed to 
polyvinylpyrrolidone (PVP), to prolong the localized bioavailability of the polynucleotide, 
thereby enhancing uptake of the polynucleotide by the organisum (see e.g., U.S. Patent 
No. 6,040,295; BP 0 465 529; WO 98/17814). PVP is a polyamide that is known to form 
complexes with a wide variety of substances, and is chemically and physiologically inert. 

[00299] Target cell sensitization can be used as a functional assay of the expression and 
HLA class I presentation of niinigene-encoded epitopes. For example, the plasmid DNA 
is introduced into a mammalian cell line that is a suitable target for standard CTL 
chromium release assays. The transfection method used will be dependent on the final 
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formulation, electroporation can be used for "naked" DNA, whereas cationic lipids or 
DNA:PVP compositions allow direct in vitro transfection. A plasmid expressing green 
fluorescent protein (GFP) can be co-transfected to allow enrichment of transfected cells 
using fluorescence activated cell sorting (FACS). The transfected cells are then 
chromium-51 ( 51 Cr) labeled and used as targets for epitope-specific CTLs. Cytolysis of 
the target cells, detected by 51 Cr release, indicates both the production and HLA 
presentation of, nunigene-encoded CTL epitopes. Expression of HTL epitopes may be 
evaluated in an analogous manner using assays to assess HTL activity. 
[00300] In vivo immunogenicity is a second approach for functional testing of minigene 
DNA formulations. Transgenic mice expressing appropriate human HLA proteins are 
immunized with the DNA product. The dose and route of administration are formulation 
dependent (e.g., TM for DNA in PBS, intraperitoneal (IP) for lipid-complexed DNA). 
Eleven to twenty-one days after immunization, splenocytes are harvested and restimulated 
for one week in the presence of peptides encoding each epitope being tested. Thereafter, 
for CTLs, standard assays are conducted to determine if there is cytolysis of peptide- 
loaded, 51 Cr-labeled target cells. Once again, lysis of target cells that were exposed to 
epitopes corresponding to those in the minigene, demonstrates DNA vaccine function and 
induction of CTLs. Immunogenicity of HTL epitopes is evaluated in transgenic mice in an 
analogous manner. 

[00301] Alternatively, the nucleic acids can be administered using ballistic delivery as 
described, for instance, in U.S. Patent No. 5,204,253. Using this technique, particles 
comprised solely of DNA are administered. In a further alternative embodiment for 
ballistic delivery, DNA can be adhered to particles, such as gold particles. 

Vaccine Compositions 

[00302] Vaccines that contain an immunologically effective amount of one or more 
peptides or polynucleotides of the invention are a further embodiment of the invention. 
The peptides can be delivered by various means or formulations, all collectively referred 
to as "vaccine" compositions. Such vaccine compositions, and/or modes of 
administration, can include, for example, naked DNA, DNA formulated with PVP, DNA 
in cationic lipid formulations; lipopeptides (e.£.,Vitiello, A et al., J. Clin. Invest. 95:341, 
1995), DNA or peptides, encapsulated e.g., in poly(DL-lactide-co-glycoUde) ("PLG") 
microspheres (see, e.g., Eldridge, et al., Molec. Immunol. 28:287-294, 1991: Alonso et al., 
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Vaccine 12:299-306, 1994; Jones et al, Vaccine 13:675-681, 1995); peptide compositions 
contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et al., Nature 
344:873-875, 1990; Hu et al, Clin Exp Immunol. 113:235-243, 1998); multiple antigen 
peptide systems (MAPs) (see e.g., Tarn, J. P., Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413, 
1988; Tarn, J.P., J. Immunol. Methods 196:17-32, 1996); viral, bacterial, or, fungal 
delivery vectors (Perkus, M. E. et al, In: Concepts in vaccine development, Kaufmann, S. 
H. E., ed., p. 379, 1996; Chakrabarti, S. et al, Nature 320:535, 1986; Hu, S. L. et al, 
Nature 320:537, 1986; Kieny, M.-P. et al, AIDS Bio/Technology 4:790, 1986; Top, F. H. 
et al, J. Infect. Dis. 124:148, 1971; Chanda, P. K. et al, Virology 175:535, 1990); 
particles of viral or synthetic origin (e.g., Kofler, N. et al, J. Immunol. Methods. 192:25, 
1996; Eldridge, J. H. etal, Sem. Hematol. 30:16, 1993; Falo, L. D., Jr. et al, Nature Med. 
7:649, 1995); adjuvants (e.g., incomplete freund's advjuvant) (Warren, H. S., Vogel, F. R., 
and Chedid, L. A. Annu. Rev. Immunol. 4:369, 1986; Gupta, R. K. et al, Vaccine 11:293, 
1993); liposomes (Reddy, R. et al, J. Immunol. 148:1585, 1992; Rock, K. L., Immunol. 
Today 17:131, 1996); or, particle-absorbed DNA (Ulmer, J. B. et al, Science 259:1745, 
1993; Robinson, H. L., Hunt, L. A., and Webster, R. G, Vaccine 11:957, 1993; Shiver, J. 
W. et al, In: Concepts in vaccine development, Kaufmann, S. H. E., ed., p. 423, 1996; 
Cease, K. B., and Berzofsky, J. A., Annu. Rev. Immunol. 12:923, 1994 and Eldridge, J. H. 
et al, Sem. Hematol. 30:16, 1993), etc. Toxin-targeted delivery technologies, also known 
as receptor mediated targeting, such as those of Avant hnmunotherapeutics, Inc. 
(Needham, Massachusetts) or attached to a stress protein, e.g., HSP 96 (Stressgen 
Biotechnologies Corp., Victoria, BC, Canada) can also be used. 
[003031 Vaccines of the invention comprise nucleic acid mediated modalities. DNA or 
RNA encoding one or more of the peptides of the invention can be administered to a 
patient. This approach is described, for instance, in Wolff e/. al, Science 247:1465 (1990) 
as well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 
5,679,647; and, WO 98/04720. Examples of DNA-based delivery technologies include 
"naked DNA", facilitated (bupivicaine, polymers (e.g., PVP), peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated 
delivery (see, e.g., U.S. Patent No. 5,922,687). Accordingly, peptide vaccines of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 
include attenuated viral hosts, such as vaccinia or fowlpox. For example, vaccinia virus is 
used as a vector to express nucleotide sequences that encode the peptides of the invention 
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(e.g., MVA). Upon introduction into an acutely or chronically infected host or into a non- 
infected host, the recombinant vaccinia virus expresses the immunogenic peptide, and 
thereby elicits an immune response. Vaccinia vectors and methods useful in immunization 
protocols are described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG 
(Bacille Calmette Guerin). BCG vectors are described in Stover et al, Nature 351:456- 
460 (1991). A wide variety of other vectors useful for therapeutic administration or 
irnmunization of the peptides of the invention, e.g. adeno and adeno-associated virus 
vectors, alpha virus vectors, retroviral vectors, Salmonella typhi vectors, detoxified 
anthrax toxin vectors, and the like, are apparent to those skilled in the art from the 
description herein. 

[00304] Furthermore, vaccines in accordance with the invention can comprise one or more 
peptides of the invention. Accordingly, a peptide can be present in a vaccine individually; 
alternatively, the peptide can exist as a homopolymer comprising multiple copies of the 
same peptide, or as a heteropolymer of various peptides. Polymers have the advantage of 
increased probability for immunological reaction and, where different peptide epitopes are 
used to make up the polymer, the ability to induce antibodies and/or T cells that react with 
different antigenic determinants of the antigen targeted for an immune response. The 
composition may be a naturally occurring region of an antigen or can be prepared, e.g., 
recombinantly or by chemical synthesis. 
[00305] Carriers that can be used with vaccines of the invention are well known in the art, 
and include, e.g., thyroglobulin, alburnins such as human serum albumin, tetanus toxoid,' 
polyarnino acids such as poly L-lysine, poly L-glutamic acid, influenza virus proteins, 
hepatitis B virus core protein, and the like. The vaccines can contain a physiologically 
tolerable diluent such as water, or a saline solution, preferably phosphate buffered saline. 
Generally, the vaccines also include an adjuvant. Adjuvants such as incomplete Freund's 
adjuvant, aluminum phosphate, aluminum hydroxide, or alum are examples of materials 
well known in the art. Additionally, as disclosed herein, CTL responses can be primed by 
conjugating peptides of the invention to lipids, such as tripahnitoyl-S-glyceryl-cysteinyl- 
seryl-serine (P 3 CSS). 

[00306] Upon immunization with a peptide composition in accordance with the invention, 
via injection (e.g., SC, ID, JM), aerosol, oral, transdermal, transmucosal, intrapleural,' 
intrathecal, or other suitable routes, the immune system of the host responds to the vaccine 
by producing antibodies, CTLs and/or HTLs specific for the desired antigen. 
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Consequently, the host becomes at least partially immune to subsequent exposure to the 
infectious agent, and thereby derives a prophylactic or therapeutic benefit. 

[00307] In certain embodiments, components that induce T cell responses are combined 

with components that induce antibody responses to the target antigen of interest. A 
preferred embodiment of such a composition comprises class I and class II epitopes in 
accordance with the invention. Alternatively, a composition comprises a class I and/or 
class II epitope in accordance with the invention, along with a PADRE® molecule 
(Epimmune, San Diego, CA). 

[00308] Vaccines of the invention can comprise antigen presenting cells, such as dendritic 

cells, as a vehicle to present peptides of the invention. For example, dendritic cells are 
transfected, e.g., with a minigene construct in accordance with the invention, in order to 
elicit immune responses. Minigenes are discussed in greater detail in a following section. 
Vaccine compositions can be created in vitro, following dendritic cell mobilization and 
harvesting, whereby loading of dendritic cells occurs in vitro. 

[00309] The vaccine compositions of the invention may also be used in combination with 

antiviral drugs such as interferon-oc, or immune adjuvants such as IL-12, GM-CSF, etc. 

[00310] Preferably, the following principles are utilized when selecting epitope(s) and/or 
analogs for inclusion in a vaccine, either peptide-based or nucleic acid-based formulations. 
Exemplary variants that may be utilized in a vaccine to treat or prevent infectious agent- 
mediated disease are set out in Tables 6-9 and Figures 1A-4. Each of the following 
principles can be balanced in order to make the selection. When multiple epitopes are to 
be used in a vaccine, the epitopes may be, but need not be, contiguous in sequence in the 
native antigen from which the epitopes are derived. Such multiple epitotes can refer to the 
order of epitopes within a peptide, or to the selection of epitopes that come from the same 
reagion, for use in either individual peptides or in a multi-epitopic peptide. 

1. ) Variants are selected which, upon administration, mimic immune responses that 
have been observed to be correlated with prevention or clearance of infectious disease. For HLA 
Class I, this generally includes 3-7 variants from at least one infectious agent or antigen thereof. 

2. ) Variants are selected that have the requisite binding affinity established to be 
correlated with immunogenicity: for HLA Class I an IC 50 of 500 nM or less, or for Class II an IC 50 
of 1000 nM or less. For HLA Class I it is presently preferred to select a peptide having an IC 50 of 
200 nM or less, as this is believed to better correlate not only to induction of an immune response, 
but to in vitro tumor cell killing as well. For HLA Al and A24, it is especially preferred to select a 
peptide having an IC 50 of 100 nM or less. 
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3. ) Supermotif bearing-variants, or a sufficient array of aliele-specific motif-bearing 
variants, are selected to give broad population coverage. La general, it is preferable to have at least 
80% population coverage. A Monte Carlo analysis, a statistical evaluation known in the art, can 
be employed to assess the breadth of population coverage. 

4. ) Of particular relevance are "nested epitopes:" Nested epitopes occur where at 
least two epitopes overlap in a given peptide sequence. For example, a nested epitope can be a 
fragment of an antigen from a region that contains multiple epitopes that are overleapping, or one 
epitope that is completely encompassed by another, e.g. 9 A2 peptides MAGE3.159 and 
MAGE3.160 are nested epitopes. A peptide comprising 'transcendent nested epitopes*' is a 
peptide that has both HLA class I and HLA class II epitopes in it. When providing nested 
epitopes, it is preferable to provide a sequence that has the greatest number of epitopes per 
provided sequence. Preferably, one avoids providing a peptide that is any longer than the amino 
terminus of the amino terminal epitope and the carboxyl terminus of the carboxyl terminal epitope 
in the peptide. When providing a sequence comprising nested epitopes, it is important to evaluate 
the sequence in order to insure that it does not have pathological or other deleterious biological 
properties; this is particularly relevant for vaccines directed to infectious organisms. 

5. ) If a protein with multiple epitopes or a polynucleotide (e.g., minigene) is created, 
an objective is to generate the smallest peptide that encompasses the epitopes of interest. This 
principle is similar, if not the same as that employed when selecting a peptide comprising nested 
epitopes. However, with an artificial peptide comprising multipe epitopes, the size minimization 
objective is balanced against the need to integrate any spacer sequences between epitopes in the 
polyepitopic protein. Spacer amino acid residues can be introduced to avoid junctional epitopes 
(an epitope recognized by the immune system, not present in the target antigen, and only created 
by the man-made juxtaposition of epitopes), or to facilitate cleavage between epitopes and thereby 
enhance epitope presentation. Junctional epitopes are generally to be avoided because the 
recipient may generate an immune response to that non-native epitope. Of particular concern is a 
junctional epitope that is a "dominant epitope." A dominant epitope may lead to such a zealous 
response that immune responses to other epitopes are diminished or suppressed. 

[00311] The principles are the same, except junctional epitopes applies to the sequences 
suirounding the epitope. One must also take care with other sequences in construct to 
avoid immune response. 

T CELL PRIMING MATERIALS 

[00312] In some embodiments it may be desirable to include in the pharmaceutical 
compositions of the invention at least one component which primes cytotoxic T 
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lymphocytes. Lipids have been identified as agents capable of facilitating the priming in 
vitro CTL response against viral antigens. For example, palmitic acid residues can be 
attached to the s-and a- amino groups of a lysine residue and then linked to an 
immunogenic peptide. One or more linking moieties can be used such as Gly, Gly-Gly-, 
Ser, Ser-Ser, or the like. The lipidated peptide can then be administered directly in a 
micelle or particle, incorporated into a liposome, or emulsified in an adjuvant, e.g., 
incomplete Freund's adjuvant. A preferred immunogenic composition comprises palmitic 
acid attached to e- and a- amino groups of Lys via a linking moiety, e.g., Ser-Ser, added to 
the amino terminus of an immunogenic peptide. 
[00313] Jh another embodiment of lipid-facilitated priming of CTL responses, E. coli 
lipoproteins, such as tripalmitoyl-S-glyceryl-cysteinyl-seryl-serine (P 3 CSS) can be used to 
prime CTL when covalently attached to an appropriate peptide. {See, e.g., Deres, et al, 
Nature 342:561, 1989). Thus, peptides of the invention can be coupled to P 3 CSS, and the 
lipopeptide adniinistered to an individual to specifically prime a CTL response to the 
target antigen. Moreover, because the induction of neutralizing antibodies can also be 
primed with P 3 CSS-conjugated epitopes, two such compositions can be combined to elicit 
both humoral and cell-mediated responses. 

DENDRITIC CELLS PULSED WITH CTL AND/OR HTL PEPTIDES 

[00314] An embodiment of a vaccine composition in accordance with the invention 
comprises ex vivo administration of a cocktail of epitope-bearing peptides to PBMC, or 
isolated DC therefrom, from the patient's blood. A pharmaceutical to facilitate harvesting 
of DC can be used, such as Progenipoietin™ (Monsanto, St. Louis, MO) or GM-CSF/EL-4. 
After pulsing the DC with peptides and prior to reinfusion into patients, the DC are 
washed to remove unbound peptides. In this embodiment, a vaccine comprises peptide- 
pulsed DCs which present the pulsed peptide epitopes in HLA molecules on their surfaces. 

[00315] The DC can be pulsed ex vivo with a cocktail of peptides, some of which stimulate 
CTL responses to one or more antigens of interest, e.g., antigens from infectious agents 
such as HTV env, HTV pol, HTV gag, HIV vpu, HBV and/or the antigens in Tables 1 1-22, 
or otherwise described herein or know in the art. Optionally, a helper T cell (HTL) 
peptide such as PADRE®, can be included to facilitate the CTL response. Thus, a vaccine 
in accordance with the invention comprising epitopes from an infectious agent is used to 
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treat or prevent disease mediated by these agents in patients. A vaccine can be used prior 
to, during, or following other therapies including, for example, antibiotic therepy, anti- 
viral therapy (e.g., highly active antiretroviral therapy (HAART) in the case of HTV- 
AIDS), antibody therapy, cancer therapy, and adjunct thereapy, whereupon the vaccine 
provides descreased morbidity, increased disease free survival and overall survival in 
recipients. 

DIAGNOSTIC AND PROGNOSTIC USES 

[00316] In one embodiment of the invention, HLA class I and class H binding peptides can 
be used.as reagents to evaluate an immune response. Preferably, the following principles 
are utilized when selecting a variant(s) for diagnostic, prognostic and similar uses. 
Potential principles include having the binding affinities described earlier, and/or matching 
the HLA-motif/supermotif of a peptide with the HLA-type of a patient. 
[00317] The evaluated immune response can be induced by any immunogen. For example, 
the immunogen may result in the production of antigen-specific CTLs or HTLs that 
recognize the peptide epitope(s) employed as the reagent. Thus, a peptide of the invention 
may or may not be used as the immunogen. Assay systems that can be used for such 
analyses include tetramer-based protocols {e.g., DimerX technology {see, e.g., Science 
274:94-99 (1996) and Proc. Natl. Acad. Sci. 95:7568-73 (1998)), staining for intraceUular 
lymphokines, interferon release assays, or ELISPOT assays. 
[00318] For example, following exposure to a putative immunogen, a peptide of the 
invention can be used in a tetramer staining assay to assess peripheral blood mononuclear 
cells for the presence of any antigen-specific CTLs. The HLA-tetrameric complex is used 
to directly visualize antigen-specific CTLs and thereby determine the frequency of such 
antigen-specific CTLs in a sample of peripheral blood mononuclear cells {see, e.g, Ogg et 
al, Science 279:2103-2106, 1998; and Altman et ah, Science 174:94-96, 1996). 
[00319] A tetramer reagent comprising a peptide of the invention is generated as follows: 
A peptide that binds to an HLA molecule is refolded in the presence of the corresponding 
HLA heavy chain and p 2 -microglobulin to generate a trimolecular complex. The complex 
is biotinylated at the carboxyl terminal end of the HLA heavy chain, at a site that was 
previously engineered into the protein. Tetramer formation is then induced by adding 
streptavidin. When fluorescently labeled streptavidin is used, the tetrameric complex is 
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used to stain antigen-specific cells. The labeled cells are then readily identified, e.g., by 
flow cytometry. Such procedures are used for diagnostic or prognostic purposes; the cells 
identified by the procedure can be used for therapeutic purposes. 
[00320] Peptides of the invention are also used as reagents to evaluate immune recall 
responses, (see, e.g., Bertoni et al, J. Clin. Invest. 100:503-513, 1997 and Penna et al J. 
Exp. Med. 174:1565-1570, 1991.) For example, a PBMC sample from an individual 
expressing a disease-associated antigen (e.g. an antigen from an infectious agent) can be 
analyzed for the presence of antigen-specific CTLs or HTLs using specific peptides. A 
blood sample containing mononuclear cells may be evaluated by cultivating the PBMCs 
and stimulating the cells with a peptide of the invention. After an appropriate cultivation 
period, the expanded cell population may be analyzed, for example, for CTL or for HTL 
activity. 

[00321] Thus, the peptides can be used to evaluate the efficacy of a vaccine. PBMCs 
obtained from a patient vaccinated with an immunogen may be analyzed by methods such 
as those described herein. The patient is HLA typed, and peptide epitopes that are bound 
by the HLA molecule(s) present in that patient are selected for analysis. The 
immunogenicity of the vaccine is indicated by the presence of CTLs and/or HTLs directed 
to epitopes present in the vaccine. 

[00322] The peptides of the invention may also be used to make antibodies, using 
techniques well known in the art (see, e.g. Current Protocols in Immunology, 
Wiley/Greene, NY; and Antibodies A Laboratory Manual Harlow, Harlow and Lane, Cold 
Spring Harbor Laboratory Press, 1989). Such antibodies are useful as reagents to 
determine the presence of disease-associated antigens. Antibodies in this category include 
those that recognize a peptide when bound by an HLA molecule, Le., antibodies that bind 
to a peptide-MHC complex. 

ADMINISTRATION FOR THERAPEUTIC OR PROPHYLACTIC PURPOSES 

[00323] The peptides and polynucleotides of the present invention, including cells and 

compositions comprising them, are useful for adrnimstration to mammals, particularly 
humans, to treat and/or prevent infection by an infectious agent such as HIV, HB V, HCV, 
HPV, Plasmodium falciparum and other agents described herein or known in me art' 
Vaccine compositions containing the peptides of the invention are administered to a 
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patient infected with a particular infectious agent or to an individual susceptible to, or 
otherwise at risk for, infection with such an agent to elicit an immune response against 
antigens of that agent and thus enhance the patient's own immune response capabilities. 
Where susceptible individuals are identified prior to infection, the composition can be 
targeted to them, thus minimizing the need for administration to a larger population. 
[00324J In therapeutic applications, peptide and/or nucleic acid compositions are administered to a 

patient in an amount sufficient to elicit an effective immune response to the infectious agent 
antigen and to thereby cure, arrest or slow symptoms and/or complications. An amount adequate 
to accomplish this is defined as "therapeutically effective dose." Amounts effective for this use 
will depend on, e.g., the particular composition administered, the manner of administration, the 
stage and severity of the disease being treated, the weight and general state of health of the patient, 
and the judgment of the prescribing physician. 
[00325] The vaccine compositions of the invention can be used purely as prophylactic agents. 

Generally the dosage for an initial prophylactic immunization generally occurs in a unit dosage 
range where the lower value is about 1, 5, 50, 500, or 1000 ug of peptide and the higher value is 
about 10,000; 20,000; 30,000; or 50,000 ug of peptide. Dosage values for a human typically range 
from about 500 ug to about 50,000 ug of peptide per 70 kilogram patient. This is followed by 
boosting dosages of between about 1.0 ug to about 50,000 ug of peptide, administered at defined 
intervals from about four weeks to six months after the initial administration of vaccine. The 
immunogeniciry of the vaccine may be assessed by measuring the specific activity of CTL and 
HTL obtained from a sample of the patient's blood. 
[00326] As noted above, peptides comprising CTL and/or HTL epitopes of the invention induce 

immune responses when presented by HLA molecules and contacted with a CTL or HTL specific 
for an epitope comprised by the peptide. The manner in which the peptide is contacted with the 
CTL or HTL is not critical to the invention. For instance, the peptide can be contacted with the 
CTL or HTL either in vitro or in vivo. If the contacting occurs in vivo, peptide can be administered 
directly, or in other forms/vehicles, e.g., DNA vectors encoding one or more peptides, viral vectors 
encoding the peptide(s), liposomes, antigen presenting cells such as dendritic cells, and the like. 
[00327] Accordingly, for pharmaceutical compositions of the invention in the form of peptides or 

polypeptides, the peptides or polypeptides can be administered directly. Alternatively, the 
peptide/polypeptides can be administered indirectly presented on APCs, or as DNA encoding 
them Furthermore, the peptides or DNA encoding them can be administered individually or as 
fusions of one or more peptide sequences. 
[00328] For therapeutic use, administration should generally begin at the first diagnosis of 

infectious agent-related disease. This is followed by boosting doses at least until symptoms are 



82 



WO 2005/012502 



PCT/US2004/009510 



substantially abated and for a period thereafter. In chronic disease states, loading doses followed 
by boosting doses may be required. 
[00329] The dosage for an initial therapeutic immunization generally occurs in a unit dosage range 

where the lower value is about 1, 5, 50, 500, or 1,000 ug of peptide and the higher value is about 
10,000; 20,000; 30,000; or 50,000 ug of peptide. Dosage values for a human typically range from 
about 500 ug to about 50,000 ug of peptide per 70 kilogram patient. Boosting dosages of between 
about 1.0 ug to about 50,000 ug of peptide, administered pursuant to a boosting regimen over 
weeks to months, can be administered depending upon the patient's response and condition. 
Patient response can be determined by measuring the specific activity of CTL and HTL obtained 
from the patient's blood. 

i 

[00330] In certain embodiments, peptides and compositions of the present invention are used in 

serious disease states. In such cases, as a result of the minimal amounts of extraneous substances 
and the relative nontoxic nature of the peptides, it is possible and may be desirable to administer 
substantial excesses of these peptide compositions relative to these stated dosage amounts. 

[00331] For treatment of chronic disease, a representative dose is in the range disclosed above, 

namely where the lower value is about 1, 5, 50, 500, or 1,000 ug of peptide and the higher value is 
about 10,000; 20,000; 30,000; or 50,000 ug of peptide, preferably from about 500 ug to about 
50,000 ug of peptide per 70 kilogram patient. Initial doses followed by boosting doses at 
established intervals, e.g., from four weeks to six months, may be required, possibly for a 
prolonged period of time to effectively immunize an individual. In the case of chronic disease, 
administration should continue until at least clinical symptoms or laboratory tests indicate that the 
disease has been eliminated or substantially abated, and for a follow-up period thereafter. The 
dosages, routes of administration, and dose schedules are adjusted in accordance with 
methodologies known in the art. 

[00332] The pharmaceutical compositions for therapeutic treatment are intended for parenteral, 

topical, oral, intrathecal, or local administration. Preferably, the pharmaceutical compositions are 
administered parentally, e.g., intravenously, subcutaneously, imradermally, or intramuscularly. 

[00333] Thus, in a preferred embodiment the invention provides compositions for parenteral 

administration which comprise a solution of the immunogenic peptides dissolved or suspended in 
an acceptable carrier, preferably an aqueous carrier. A variety of aqueous carriers may be used, 
e.g., water, buffered water, 0.8% saline, 0.3% glycine, hyaluronic acid and the like. These 
compositions may be sterilized by conventional, well known sterilization techniques, or may be 
sterile filtered. The resulting aqueous solutions may be packaged for use as is, or lyopbilized, the 
lyophilized preparation being combined with a sterile solution prior to administration. The 
compositions may contain pharmaceutically acceptable auxiliary substances or pharmaceutical 
excipients as may be required to approximate physiological conditions, such as pH-adjusting and 
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buffering agents, tonicity adjusting agents, wetting agents, preservatives, and the like, for example, 
sodium acetate, sodium lactate, sodium chloride, potassium chloride, calcium chloride, sorbitan 
monolaurate, triethanolamine oleate, etc. 

[00334] The concentration of peptides of the invention in the pharmaceutical formulations can vary 

widely, i.e., from less than about 0.1%, usually at or at least'about 2% to as much as 20% to 50% 
or more by weight, and will be selected primarily by fluid volumes, viscosities, etc., in accordance 
with the particular mode of aciministration selected. 

[00335] A human unit dose form of the peptide composition is typically included in a 

pharmaceutical composition that also comprises a human unit dose of an acceptable carrier, 
preferably an aqueous carrier, and is administered in a volume of fluid that is known by those of 
skill in the art to be used for administration of such compositions to humans {see, e.g., 
Remington's Pharmaceutical Sciences, 17 th Edition, A. Gennaro, Editor, Mack Publishing Co., 
Easton, Pennsylvania, 1985). 

[00336] The peptides of the invention can also be administered via liposomes, which serve to 

target the peptides to a particular tissue, such as lymphoid tissue, or to target selectively to infected 
cells, as well as to increase the half-life of the peptide composition. Liposomes include emulsions, 
foams, micelles, insoluble monolayers, liquid crystals, phospholipid dispersions, lamellar layers 
and the like. In these preparations, the peptide to be delivered is incorporated as part of a 
liposome, alone or in conjunction with a molecule which binds to a receptor prevalent among 
lymphoid cells (such as monoclonal antibodies which bind to the CD45 antigen) or with other 
therapeutic or immunogenic compositions. Thus, liposomes either filled or decorated with a 
desired peptide of the invention can be directed to the site of lymphoid cells, where the liposomes 
then deliver the peptide compositions. Liposomes for use in accordance with the invention are 
formed from standard vesicle-forming lipids, which generally include neutral and negatively 
charged phospholipids and a sterol, such as cholesterol. The selection of lipids is generally guided 
by consideration of, e.g., liposome size, acid lability and stability of the liposomes in the blood 
stream. A variety of methods are available for preparing liposomes, as described in, e.g., Szoka, et 
al, Ann. Rev. Biophys. Bioeng. 9:467 (1980), and U.S. Patent Nos. 4,235,871, 4,501,728, 
4,837,028, and 5,019,369. 

[00337] For targeting compositions of the invention to cells of the immune system, a ligand can be 

incorporated into the liposome, e.g., antibodies or fragments thereof specific for cell surface 
determinants of the desired immune system cells. A liposome suspension containing a peptide 
may be administered intravenously, locally, topically, etc. in a dose which varies according to, 
inter alia, the manner of administration, the peptide being delivered, and the stage of the disease 
being treated. 
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[00338] For solid compositions, conventional nontoxic solid carriers may be used which include, 

for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharin, talcum, cellulose, glucose, sucrose, magnesium carbonate, and the like. For oral 
administration, a pharmaceutically acceptable nontoxic composition is formed by incorporating 
any of the normally employed excipients, such as those carriers previously listed, and generally 
10-95% of active ingredient, that is, one or more peptides of the invention, often at a concentration 
of25%-75%. 

[00339] For aerosol adniinistration, the immunogenic peptides are preferably supplied in finely 

divided form, along with a surfactant and propellant. Typical percentages of peptides are 0.01%- 
20% by weight, often 1%-10%. The surfactant must, of course, be pharmaceutically acceptable, 
and preferably soluble in the propellant. Representative of such agents are the esters or partial 
esters of fatty acids containing from 6 to 22 carbon atoms, such as caproic, octanoic, lauric, 
palmitic, stearic, linoleic, linolenic, olesteric and oleic acids with an aliphatic polyhydric alcohol 
or its cyclic anhydride. Mixed esters, such as mixed or natural glycerides may be employed. The 
surfactant may constitute 0.1%-20% by weight of the composition, preferably 0.25-5%. The 
balance of the composition is ordinarily propellant, although an atomizer may be used in which no 
propellant is necessary and other percentages are adjusted accordingly. A carrier can also be 
included, e.g., lecilhin for intranasal delivery. 

[00340] Antigenic peptides of the invention have been used to elicit a CTL and/or HTL response 

ex vivo, as well. The resulting CTLs or HTLs can be used to treat chronic infections, or tumors in 
patients that do not respond to other conventional forms of therapy, or who do not respond to a 
therapeutic peptide or nucleic acid vaccine in accordance with the invention. Ex vivo CTL or HTL 
responses to a particular antigen (infectious or tumor-associated) are induced by incubating in 
tissue culture the patienfs, or genetically compatible, CTL or HTL precursor cells together with a 
source of antigen-presenting cells (APC), such as dendritic cells, and the appropriate immunogenic 
peptide. After an appropriate incubation time (typically about 7-28 days), in which the precursor 
cells are activated and expanded into effector cells, the cells are infused back into the patient, 
where they will destroy (CTL) or facilitate destruction (HTL) of their specific target cell (an 
infected cell or a tumor cell). 

KITS 

[00341] The peptide and nucleic acid compositions of this invention can be provided in kit 
form together with instructions for vaccine administration. Typically the kit would 
include desired compositions) of the invention in a container, preferably in unit dosage 
form and instructions for administration. For example , a kit would include an APC, such 
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as a dendritic cell, previously exposed to and now presenting peptides of the invention in a 
container, preferably in unit dosage form together with instructions for administration. An 
alternative kit would include a minigene construct with desired nucleic acids of the 
invention in a container, preferably in unit dosage form together with instructions for 
administration. Lymphokmes such as IL-2 or IL-12 may also be included in the kit. Other 
kit components that may also be desirable include, for example, a sterile syringe, booster 
dosages, and other desired excipients. 
{00342] The invention will be described in greater detail by way of specific examples. The 
following examples are offered for illustrative purposes, and are not intended to limit the 
invention in any manner. Those of skill in the art will readily recognize a variety of non- 
critical parameters that can be changed or modified to yield alternative embodiments in 
accordance with the invention. 

EXAMPLES 

EXAMPLE l. HLA CLASS I AND CLASS E BINDING ASSAYS 

[00343] The following example of peptide binding to HLA molecules demonstrates 
quantification of binding affinities of HLA class I and class H peptides. Binding assays 
can be performed with peptides that are either motif-bearing or not motif-bearing. 

[00344] Cell lysates were prepared and HLA molecules purified in accordance with 
disclosed protocols (Sidney et al., Current Protocols in Immunology 18.3.1 (1998); 
Sidney, et al, J. Immunol 154:247 (1995); Sette, et al, Mol Immunol 31:813 (1994)). 
The cell lines used as sources of HLA molecules and the antibodies used for the extraction 
of the HLA molecules from the cell lysates are also described in these publications and are 
well known in the art. 

[00345] Epstein-Barr virus (EBV)-transformed homozygous cell lines, fibroblasts, CIR, or 
721.221-transfectants were used as sources of HLA class I molecules. These cells were 
cultured in RPMI 1640 medium supplemented with 2mM I^glutamine (GD3CO, Grand 
Island, NY), 50uM 2-ME, 100ng/ml of streptomycin, lOOU/ml of penicillin (Irvine 
Scientific) and 10% heat-inactivated FCS (Irvine Scientific, Santa Ana, CA). 

[00346] Cell lysates were prepared as follows. Briefly, cells were lysed at a concentration 
of 10 8 cells/ml in 50 mM Tris-HCl, pH 8.5, containing 1% Nonidet P-40 (Fluka 
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Biochemika, Buchs, Switzerland), 150 mM NaCl, 5 mM EDTA, and 2 mM PMSF. 
Lysates were cleared of debris and nuclei by centrifugation at 15,000 x g for 30min. 

[00347] HLA molecules were purified from lysates by affinity chromatography. Lysates 
were passed twice through two pre-columns of inactivated Sepharose CL4-B and protein 
A-Sepharose. Next, the lysate was passed over a column of Sepharose CL-4B beads 
coupled to an appropriate antibody. The anti-HLA column was then washed with 10- 
column volumes of lOmM Tris-HCL, pH 8.0, in 1% NP-40, PBS, 2-column volumes of 
PBS, and 2-column volumes of PBS containing 0.4% n-octylglucoside. Finally, MHC 
molecules were eluted with 50mM diethylamine in 0.1 5M NaCl containing 0.4% n- 
octylglucoside, pH 11.5. A 1/25 volume of 2.0M Tris, pH 6.8, was added to the eluate to 
reduce the pH to -8.0. Eluates were then concentrated by centrifugation in Centriprep 30 
concentrators at 2000 rpm (Amicon, Beverly, MA). Protein content was evaluated by a 
BCA protein assay (Pierce Chemical Co., Rockford, IL) and confirmed by SDS-PAGE. 

[00348] A detailed description of the protocol utilized to measure the binding of peptides to 
Class I and Class H MHC has been published (Sette et al, Mol Immunol. 31:813, 1994; 
Sidney et al, in Current Protocols in Immunology, Margulies, Ed., John Wiley & Sons, 
New York, Section 18.3, 1998). Briefly, purified MHC molecules (5 to 500nM) were 
incubated with various unlabeled peptide inhibitors and l-10nM 125 I-radiolabeled probe 
peptides for 48h in PBS containing 0.05% Nonidet P-40 (NP40) (or 20% w/v digitonin for 
H-2 IA assays) in the presence of a protease inhibitor cocktail. The final concentrations of 
protease inhibitors (each from CalBioChem, La Jolla, CA) were 1 mM PMSF, 1.3 nM 
1.10 phenanthroline, 73 uM pepstatin A, 8mM EDTA 6mM N-ethylmaleimide (for Class 
H assays), and 200 pM N alpha-p-tosyl-L-lysine chloromethyl ketone (TLCK). All assays 
were performed at pH 7.0 with the exception of DRB1*0301, which was performed at pH 
4.5, and DRB1*1601 (DR2w2ip,) and DRB4*0101 (DRw53), which were performed at 
pH 5.0. pH was adjusted as described elsewhere {see Sidney et al, in Current Protocols 
in Immunology, Margulies, Ed., John Wiley & Sons, New York, Section 18.3, 1998). 
[00349] Following incubation, MHC-peptide complexes were separated from free peptide 
by gel filtration on 7.8 mm x 15 cm TSK200 columns (TosoHaas 16215, 
Montgomeryville, PA), eluted at 1.2 mls/min with PBS pH 6.5 containing 0.5% NP40 and 
0.1% NaN 3 . Because the large size of the radiolabeled peptide used for the DRB1*1501 
(DR2w2Pi) assay makes separation of bound from unbound peaks more difficult under 
these conditions, all DRB1*1501 (DR2w2p,) assays were performed using a 7.8mm x 
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30cm TSK2000 column eluted at 0.6 mls/min. The eluate from the TSK columns was 
passed through a Beckman 170 radioisotope detector, and radioactivity was plotted and 
integrated using a Hewlett-Packard 3396A integrator, and the fraction of peptide bound 
was determined. 

[00350] Radiolabeled peptides were iodinated using the cWoramine-T method. 
Representative radiolabeled probe peptides utilized in each assay, and its assay specific 
IC 50 nM, are known in the art. Typically, in preliminary experiments, each MHC 
preparation was titered in the presence of fixed amounts of radiolabeled peptides to 
determine the concentration of HLA molecules necessary to bind 10-20% of the total 
radioactivity. All subsequent inhibition and direct binding assays were performed using 
these HLA concentrations. 
[00351] Since under these conditions [label]<[HLA] and IC 50 >[HLA], the measured IC 50 
values are reasonable approximations of the true K D values. Peptide inhibitors are 
typically tested at concentrations ranging from 120 ug/ml to 1.2 ng/ml, and are tested in 
two to four completely independent experiments. To allow comparison of the data 
obtained in different experiments, a relative binding figure is calculated for each peptide 
by dividing the IC 50 of a positive control for inhibition by the IC 50 for each tested peptide 
(typically unlabeled versions of the radiolabeled probe peptide). For inter-experiment 
comparisons, relative binding values are compiled. These values can subsequently be 
converted back into IC 50 nM values by dividing the IC 50 nM of the positive controls for 
inhibition by the relative binding of the peptide of interest. This method of data 
compilation has proven to be the most accurate and consistent for comparing peptides that 
have been tested on different days, or with different lots of purified MHC. 
[00352] Because the antibody used for HLA-DR purification (LB3.1) is cc-chain specific, 
Pi molecules are not separated from p 3 (and/or p 4 and p 5 ) molecules. The p, specificity of 
the binding assay is obvious in the cases of DRB1*0101 (DR1), DRB1*0802 (DR8w2), 
and DRB1*0803 (DR8w3), where no p 3 is expressed. It has also been demonstrated for 
DRB1*0301 (DR3) and DRB3*0101 (DR52a), DRB1*0401 (DR4w4), DRB1*0404 
(DR4wl4), DRB1*0405 (DR4wl5), DRB1*1101 (DR5), DRB1*1201 (DR5wl2), 
DRB1-1302 (DR6wl9) and DRB1*0701 (DR7). The problem of p chain specificity for 
DRB1*1501 (DR2W2P0, DRB5*0101 (DR2w2p 2 ), DRB1*1601 (DR2w2ipO, 
DRB5*0201 (DR51DW21), and DRB4*0101 (DRw53) assays is circumvented by the usi 
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of fibroblasts. Development and validation of assays with regard to DR0 molecule 
specificity have been described previously (see, e.g., Southwood et al, J. Immunol. 
160:3363-3373, 1998). 

[00353] Binding assays as outlined above may be used to analyze supermotif and/or motif- 

bearing epitopes. 



EXAMPLE 2. RECOGNITION OF VARIANT PEPTIDES BY CTL DERIVED FROM 

DNA IMMUNIZATION 

[00354] Variants corresponding to five HLA-A2 and -A3 restricted epitopes from 167 HTV 
varianst were identified and synthesized. These represented all the complete sequences in 
the Los Alamos database at the time (116 strains), as well as 51 complete clade C 
sequences from Botswana, and included 22 subtype B and 62 subtype C sequences. These 
peptides were then characterized with regard to MHC binding, variant distribution, and 
immunogenicity. To measure immunogenicity, HLA-A2/K b or HLA-A1 1/K b transgenic 
mice were immunized with the epitopes encoded in a DNA based format 0- Eleven days 
after immunization, splenocytes were restimulated with either the epitope corresponding to 
the epitope encoded by the DNA (parent) or each of the variant peptides. After 6 days in 
culture, IFN-y secretion was measured in response to the peptide used to stimulate each 
culture. 

[00355] The data for these epitopes are shown in Figure 1 . The HLA-A2-restricted epitope 
corresponding to the Env 134 epitope (KLTPLCVTL; Figure 1 A) used as the immunogen 
was the form observed most often (134/167). All single anchor variants were recognized 
to approximately the same extent as the parent peptide. Many of the single non-anchor 
variants (9/13) were also recognized within 10-fold of the parent peptide. Conservative 
substitutions (R and Q for K; see Table 4) at position 1 (PI) were tolerated, while the non- 
conservative substitution (E for K; see Table 4) lowered binding and eliminated 
recognition. Three P4 variants were observed. Two of these (F or S for P) were recognized 
within 10-fold of the recognition of the parent peptide, while one substitution (Q for P) 
completely eliminated recognition. The binding for these peptides was not significantly 
different from the parent peptide, indicating that this residue may be involved in TCR 
recognition. Both the conservative (F for L) and non-conservative (R for L) substitutions 
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seen at P5 completely abrogated recognition, indicating that this residue is important in 
TCR recognition. Finally, one substitution at P8 (I for V), and four substitutions at P9 
show little effect on recognition. None of the variants with multiple substitutions were 
recognized, although this may be due to the poor binding of these peptides. 

[00356] The Gag 386 sequence utilized as the immunogen was the second most common 
form (VLAEAMSQV), present in 54 strains (Figure IB). The most prevalent variant, 
differing by a single tolerated C terminal anchor residue (V to A; 67 strains), was 
recognized equally to the parent epitope by CTL raised against the parent, as were the 
remaining single-anchor variants. Single substitutions were also tolerated at the non- 
anchor positions, PI (I for V) and P8 (R, K, or H for Q). Only the P7 variant (G for S), 
probably a TCR contact residue, was not recognized. 

[00357] Many of the multiple variants for Gag 386 were also recognized by CTL raised 
against the parent peptide. All the variants with multiple changes combined a change of V 
to A or T at the C terminus with 1-3 additional substitutions. Two variants with N terminal 
changes (V to A or I) were observed. The non-conservative A substitution was not 
recognized, while the conservative I substitution was. A double variant with a 
conservative substitution at P3 (A to G) was not recognized, implicating P3 in TCR 
recognition. Double variants with conservative changes at position 8 (Q to R, K, or H) 
were not well recognized, although the variants with single changes at the same positions 
were recognized. The variant combining a non-conservative A residue at position 8 with A 
at the C terminus was recognized as well as the parent. Equally surprising was the 
observation that all the variants with 3 or 4 substitutions were recognized within 10-fold of 
the parent peptide. 

[00358] The parent form of the HLA-A2-res1ricted epitope, Vpr 62 (RILQQLLFI; Figure 
1C) was the most common form observed (86/167). Seven well-tolerated single anchor 
substitutions, 4 P2 and 3 C terminal, were also observed, accounting for most of the 
remaining variants (47/167). Single substitutions were, in general, also well tolerated. The 
single exception was the non-conservative substitution (P for L) at P6, while an M for L 
substitution at the same site was well tolerated. Binding was not affected for either variant, 
indicating that the reduction in activity is due to a change in a contact residue. Most 
variants with multiple changes also showed recognition to approximately the same extent 
as the parent. Several variants however did show reduced recognition. The variant with 
changes at both anchors (I to T at P2 and I to T at P9) had reduced binding (IC 50 of 9700), 
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and recognition of the peptide was reduced, although not lost completely. Two variants 
with Q to H changes at P5, in combination with anchor residue changes (I to M at P2 and I 
to A at P9), exhibited greatly reduced recognition although binding was not affected. 
Other changes at P5 (Q to R or L at P5) reduced recognition only slightly. 

[00359] The HLA-A3/1 1-restricted epitope, Pol 98 (Figure ID), represented the most 
diverse epitope in terms of the number of variant epitopes identified. The peptide encoded 
in the DNA was represented in only 18 out of 167 strains. Approximately a third of the 
peptides identified at that position (49 out of 167) did not have recognizable A3/A11 
motifs. The most common variant (30 strains) differed from the parent peptide at 3 
residues (VS1KVGGQIK), but was recognized within 10-fold of the parent peptide. Two 
variants with conservative changes at anchor residues were both recognized, although the 
T to A substitution at P2 resulted in a 10-fold reduction in recognition of the variant 
peptide. All peptides with single changes in non-anchor positions were also recognized, 
although the P5 variant (G to E) exhibited a decrease in recognition. As the binding was 
not affected, this probably indicates involvement in T cell recognition. 

[00360] Peptides with two changes showed mixed results. In general, peptides with a V 
substitution at position 3, in combination with another substitution were recognized to the 
same extent as the corresponding single substitution, indicating the V substitution was 
tolerated well and is not a TCR contact residue. Combinations including the P2 anchor 
residue (T to A or N) were not recognized, although the binding of these peptides was also 
low. Variants with 3 substitutions were generally not recognized well. Two exceptions 
with very conservative substitutions were noted (Figure ID). CTL were unable to 
recognize peptides with four or more substitutions. 

[00361] The HLA-A3/11- restricted Env 47 epitope (Figure IE; VTVYYGVPVWK) was 
highly conserved, with only 9 variants identified. The most common form observed was 
the parent peptide (99 strains), while the second most common form, a single anchor 
substitution observed in 40 strains, was recognized to the same extent as the parent. All the 
variants were recognized within 10-fold of the parent epitope. 

[00362] Taken together, these data show trends towards promiscuous recognition of variant 
peptides by CTL generated from immunization with a single peptide. In general, changes 
that disrupted binding also decreased recognition. Recognition was also affected by the 
position of the change, with potential TCR contact residues (P3-7) exerting a greater effect 
on recognition than other residues. In general, conservative residue changes were more 
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widely tolerated than were non-conservative changes. Recognition was also dependent on 
the number of changes, with progressively lower recognition with a greater number of 
changes. 

[00363] Recognition after multiple restimulations The observed recognition of variant 
peptides by CTL raised against the parent peptide might be due to either promiscuous 
recognition at the level of a single TCR or simply a mixture of TCRs against the 
immunizing peptide which are each able to recognize subtly different peptides. To 
distinguish between these two possibilities, Env 134- or Gag 386-specific T cell lines were 
generated by stimulating five times with the immunizing peptide, and then tested for 
recognition of a partial panel of variant peptides. These T cell lines were also 
characterized for Vp TCR usage against a panel of antibodies predicted to react with the 
TCR of the mouse strains utilized for these experiments. 

[00364] The data for these peptide-specific lines are shown in Table 5. Because the SU is a 
measure of the number of cells needed to secrete a defined amount of IFN-y, a higher SU 
value would correspond to an enrichment of IFN-y producing cells. A comparison of one 
and five peptide stimulations indeed shows an enrichment of CTL specific for the 
immunizing peptide for both of the peptide lines generated (Table 5A and 5B, first line). 
The Gag 386 line (Table 5A) also demonstrated increased recognition of all the variant 
peptides measured except one peptide (ILAEAMSKA) that was never recognized. The 
Env 134 line also demonstrated enrichment for CTL able to recognize several of the 
variant peptides (Table 5B). 

[00365] To further characterize these lines, we examined them for Vp usage, utilizing a 
panel of commercially available antibodies available for mouse TCR Vp 2-14. To 
determine background levels for the various TCR Vp molecules, primary splenocytes from 
mice that had been immunized with EP HIV- 1090 were also examined. The results for the 
Gag 386 line are shown in Figure 2 A. After a single stimulation with the parent peptide, 
the Gag 386 line showed a mixture of TCR positive populations, including Vp 3, 5, and 
14. After 5 stimulations, those populations had been reduced to background levels, and 
approximately 50% of the CD8+ cells expressed the VP 6 TCR. The Env 134 line showed 
a similar pattern of multiple TCR positive populations after a single round of stimulation 
with reduction to background levels after 5 stimulations (data not shown). However, no 
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single V0 usage significantly above background could be demonstrated, probably due to 
lack of the relevant TCR V0 antibody. 

[00366] Both lines were also characterized with regard to the affinity of certain of the 
variant peptides by titrating the variant peptides examined above (Table 5A and 5B). The 
data for both the Gag 386 and Env 134 lines are shown in Figure 2B. For the Gag 386 line, 
the parent peptide along with two single anchor variants (VLAEAMSQI and 
VLAEAMSQA) showed the highest affinity. Four other peptides demonstrated lower 
affinity, but still produced IFN-y in response to higher peptide concentrations. A single 
peptide (ILAEAMSKA) was not recognized. 

[00367] As expected, the parent peptide, which was used to generate the Env 134 line, 
showed the highest affinity for the TCR. The other 2 variant peptides, KITPLCVTL and 
QLTPLCVTL, also demonstrated higher affinity, but reduced from the parent peptide by 
approximately 10-fold and 100-fold, respectively. It was notable that only at the highest 
peptide concentration examined (1 ug/ml) was any IFN-y secretion detected for five of the 
peptides (QITPLCVTL, ELTPLCVTL, KLTPFCVTL, KLTPLCVIL, and KLTPLCVPL). 
These five peptides showed little or no enrichment of CTL able to recognize them, and 
exhibited the lowest activity as measured by SU after five restimulations (see Table 5B). 

[00368] In summary, these cell lines 'seem to consist of a narrow, possibly single, TCR 
population. This TCR population recognizes the parent peptide with the highest affinity, 
but is also able to recognize a number of other variant peptides with equal or lesser 
affinity. 

[00369] Recognition of variant peptides by CTL derived from an HTV infected patient 
[00370] To determine if the same immunological conservation was observed in natural 
infections, we identified an HIV-infected individual expressing the HLA-A3 allele. The 
HTV strain and subtype with which this patient was infected is unknown. We had 
previously shown that T cells from this individual responded to the HLA-A3 restricted 
epitopes Pol 98 and Env 47. PBL from this patient were examined in an ELISPOT assay to 
determine if they also showed the capacity for broad cross-reactivity. The data are shown 
in Figure 3. Although the actual peptide represented in the HTV strain with which this 
individual is infected is unknown, we observed recognition of a large number of the 
variant peptides for both Pol 98 (Figure 3A) and Env 47 (Figure 3B). The recognition 
patterns were remarkably similar for the mouse and patient data (compare Figure 1 and 
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Figure 3), although the mouse expressed a transgene for HLA-A11 and the patient was 
HLA-A3. 

[00371] Prediction of Immunological Conservation. We had observed that the variant 
peptides that were recognized by CTL raised against the parent epitope had amino acid 
substitutions that followed previous observations. For example, the anchor residue 
changes that were tolerated in the variant peptides were also described as anchors that to 
define the respective HLA supertypes 0- In general, conservative substitutions were 
tolerated at non-anchor residues, while non-conservative substitutions were less well 
tolerated. These followed closely the prediction model used to identify heteroclitic analogs 
(Tangri et al). 

[00372] Based on these observations, we designed a computer program to predict 
immunological conservation. For anchor positions, this program utilized the conserved 
anchor residues described for the A2, A3, and B7 supertypes. For non-anchor positions 
only conservative substitutions, as defined in Tangri etal. 0> were allowed. All 
substitutions at non-anchor positions were analyzed independently and all conservative 
substitutions were allowed regardless of the number of substitutions. Finally, the position 
of the substitution was not factored into analysis. Each variant was compared with the 
parent epitope, and its ability to be recognized was predicted as either positive or negative. 

[00373] The first sets of epitopes to be evaluated by this program were the five HIV 
epitopes and variants previously described. For the Env 134 epitope, the program 
predicted that 13 of the variant peptides should be immunologically conserved, while 6 
should not be recognized. Comparison of the observed immunological data with the 
prediction showed that the program predicted correctly for 14 of the peptides and 
incorrectly for 5. Of the incorrect predictions, in two cases the program predicted negative 
results for peptides that were recognized, while in 3 cases the program predicted positive 
results for peptides that were not recognized. A similar analysis was performed for all five 
peptides. Of 101 total variant peptides, 68 were correctly identified (67%). The discordant 
data were fairly evenly split between peptides incorrectly predicted negative (15) and 
those incorrectly predicted positive (18). 

[00374] As noted previously, the more substitutions present in a variant peptide, the lower 
the likelihood of its immunogenicity. Since the prediction program treated all substitutions 
independently, and did not take into account the number of substitutions, we hypothesized 
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that prediction of single substitutions would be more accurate. Indeed, the immunogenicity 
of 38 of 47 single substitution variants (80%) was correctly predicted. 

[00375] With the limitations of the program in mind, it is useful to predict the recognition 
of the variants for a package of HLA-A2, -A3, and -B7 supertype epitopes. These epitopes 
had been identified as being well conserved in Clade B variants. When comparing the 
conservation of this group of epitopes based on sequence identity versus immunological 
conservation, it is interesting to note that the predicted recognition gains taking into 
account immunological conservation are significant (Table 6). 

[00376] This particular group of 21 epitopes was selected based on their identity 
conservation in Clade B HIV sequences, with conservation across HTV clades as a 
secondary consideration. Because of this criteria, the form of epitope chosen as the parent 
peptide was not the most common variant (e.g. Gag 386, Gag 271, Pol 98). In some cases 
(e.g., see Gag 386 data), the "parent" epitope and the most common variant were 
recognized to the same extent. However, in some cases the selection of epitope to include 
as the "parent" epitope was predicted to make a difference in the immunological 
conservation. An example of this was the Gag 271 epitope (Figure 4). The variant most 
commonly seen in clade B sequences was the MTNNPPIPV form, while the most 
common form of the epitope was MTSNPPIPV. Not all amino acids are considered equal 
to each other in their ability to substitute (Tangri). For example, asparagine (N) is 
considered a conservative substitution for serine (S), while the opposite substitution in 
only considered semi-conserved. When the program calculated immunological 
conservation using the MTNNPPIPV peptide as the parent peptide, only two variants were 
predicted to be immunogenic. However, when the immunological conservation was 
predicted using the MTSNPPIPV peptide, most of the variants were predicted to be 
recognized (Figure 4). This prediction was tested using HLA-A2 transgenic mice. The 
results show that if the MTSNPPIPV form of the peptide was utilized in vaccines, 
approximately 152 of 167 variants would be recognized, while if the MTNNPPIPV form 
of the epitope was utilized, only 39 of 167 variants would be recognized. This has 
important implications in epitope selection for vaccine development, and epitope 
performance can be predicted. 
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EXAMPLE 3. A PADRE® MOLECULE AS A HELPER EPITOPE FOR ENHANCEMENT OF 

CTL INDUCTION 

100377] There is increasing evidence that HTL activity is critical for the induction of long 
lasting CTL responses (Livingston et al. J. Immunol 162:3088-3095 (1999); Walter et al, 
New Engl. J. Med. 333:1038-1044 (1995); Hu et al, J. Exp. Med. 177:1681-1690 (1993)). 
Therefore, one or more peptides that bind to HLA class II molecules and stimulate HTLs 
can be used in accordance with the invention. Accordingly, a preferred embodiment of a 
vaccine includes a molecule from the PADRE® family of universal T helper cell epitopes 
(HTL) that target most DR molecules in a manner designed to stimulate helper T cells. 
For instance, a pan-DR-binding epitope peptide having the formula: aKXVAAZTLKAAa, 
where "X" is either cyclohexylalanine, phenylalanine, or tyrosine; "Z" is either 
tryptophan, tyrosine, histidine or asparagjne; and "a" is either D-alanine or L-alanine (SEQ 
ID NO:29), has been found to bind to most HLA-DR alleles, and to stimulate the response 
of T helper lymphocytes from most individuals, regardless of their HLA type. 

[00378] A particularly preferred PADRE® molecule is a synthetic peptide, 
aKXV AAWTLKAAa (a = D-alanine, X = cyclohexylalanine), containing non-natural 
amino acids, specifically engineered to maximize both HLA-DR binding capacity and 
induction of T cell immune responses. 

[00379] Alternative preferred PADRE® molecules are the peptides, aKFVAAWTLKAAa, 
aKYV AAWTLKAAa, aKFVAAYTLKAAa, aKXVAAYTLKAAa,. aKYVAAYTLKAAa, 
aKFV AAHTLKAAa, aKXV AAHTLKAAa, aKYVAAHTLKAAa, aKFVAANTLKAAa, 
aKXV AANTLKAAa, aKYVAANTLKAAa, AKXV AAWTLKAAA (SEQ ID NO:30), 
AKFVAAWTLKAAA (SEQ ID NO:31), AKYV AAWTLKAAA (SEQ ID NO:32), 
AKFVAAYTLKAAA (SEQ ID NO:33), AKXVAAYTLKAAA (SEQ ID NO:34), 
AKYVAAYTLKAAA (SEQ ID NO:35), AKFV AAHTLKAAA (SEQ ID NO:36), 
AKXV AAHTLKAAA (SEQ ID NO:37), AKYVAAHTLKAAA (SEQ ID NO:38), 
AKFVAANTLKAAA (SEQ ID NO:39), AKXVAANTLKAAA (SEQ ID NO:40), 
AKYVAANTLKAAA (SEQ ID NO:41) (a = D-alanine, X = cyclohexylalanine). 

[00380] In a preferred embodiment, the PADRE® peptide is amidated. For example, a 
particularly preferred amidated embodiment of a PADRE® molecule is conventionally 
written aKXV AA WTLKAAa-NH 2 . 
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[00381] Competitive inhibition assays with purified HLA-DR molecules demonstrated that 
the PADRE® molecule aKXV AAWTLKAAa-NH 2 binds with high or intermediate 
affinity (IC 50 £1,000 nM) to 15 out of 16 of the most prevalent HLA-DR molecules 
((Kawashima et al, Human Immunology 59:1-14 (1998); Alexander et al, Immunity 
1:751-761 (1994)). A comparison of the DR binding capacity of PADRE® and tetanus 
toxoid (TT) peptide 830-843, a "universal" epitope has been published (Panina-Bordignon 
et al, Eur. J. Immunology 19:2237-2242 (1989)). The TT 830-843 peptide bound to only 
seven of 16 DR molecules tested, while PADRE® bound 15 of 16. At least 1 of the 15 DR 
molecules that bind PADRE® is predicted to be present in >95% of all humans. 
Therefore, this PADRE® molecule is anticipated to induce an HTL response in virtually all 
patients, despite the extensive polymorphism of HLA-DR molecules in the human 
population. 

[00382] PADRE® has been specifically engineered for optimal immunogenicity for human 
T cells. Representative data from in vitro primary immunizations of normal human T cells 
with TT 830-843 antigen and the PADRE® molecule aKXVAAWTLKAAa-NH 2 are 
shown in Figure 1. Peripheral blood mononuclear cells (PBMC) from three normal donors 
were stimulated with the peptides in vitro. Following the third round of stimulation, it was 
observed that PADRE® generated significant primary T cell responses for all three donors 
as measured in a standard T cell proliferation assay. With the PADRE® peptide, the 
10,000 cpm proliferation level was generally reached with 10 to 100 ng/ml of antigen. In 
contrast, TT 830-843 antigen generated responses for only 2 out of 3 of the individuals 
tested. Responses approaching the 10,000 cpm range were reached with about 10,000 
ng/ml of antigen. In this respect, it was noted that PADRE® was, on a molar basis, about 
100-fold more potent than TT 830-843 antigen for activation of T cell responses. 

[00383] Early data from a phase VR investigator-sponsored trial, conducted at the 
University of Leiden (C.J.M. Melief), support the principle that the PADRE® molecule 
aKXV AAWTLKAAa, possibly the amidated aKXVAAWTLKAAa -NH 2 , is highly 
immunogenic in humans (Ressing et al, J. Immunother. 23(2):255-66 (2000)). In this 
trial, a PADRE® molecule was co-emulsified with various human papilloma virus 
(HPV)-derived CTL epitopes and was injected into patients with recurrent or residual 
cervical carcinoma. However, because of the late stage of carcinoma with the study 
patients, it was expected that these patients were immunocompromised. The patients' 
immunocompromised status was demonstrated by their low frequency of influenza virus- 
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specific CTL, reduced levels of CD3 expression, and low incidence of proliferative recall 
responses after in vitro stimulation with conventional antigens. Thus, no efficacy was 
anticipated in the University of Leiden trial, rather the goal of that trial was essentially to 
evaluate safety. Safety was, in fact, demonstrated. In addition to a favorable safety 
profile, PADRE® T cell reactivity was detected in four of 12 patients (Figure 2) in spite of 
the reduced immune competence of these patients. 
[00384] Thus, the PADRE® peptide component(s) of the vaccine bind with broad 
specificity to multiple allelic forms of HLA-DR molecules. Moreover, PADRE® peptide 
components) bind with high affinity (IC50 ^1000 nM), i.e., at a level of affinity correlated 
with being immunogenic for HLA Class II restricted T cells. The in vivo administration of 
PADRE® peptide(s) stimulates the proliferation of HTL in normal humans as well as 
patient populations. 

[00385] One or more PADRE® peptide(s) may be included in a composition, e.g., a 

vaccine, comprising one or more peptides, either as an individual peptide(s), fused to one 
or more variant peptides, or both. 

EXAMPLE 4. CTL RECOGNITION OF ENDOGENOUS PROCESSED ANTIGENS 

AFTER PRIMING 

[00386] This example determines that CTL induced by native or analoged peptide epitopes 
recognize endogenously synthesized, z.e., native antigens. 

[00387] Effector cells isolated from transgenic mice that are immunized with peptide 

epitopes are re-stimulated in vitro using peptide-coated stimulator cells. Six days later, 
effector cells are assayed for cytotoxicity and the cell lines that contain peptide-specific 
cytotoxic activity are further re-stimulated. An additional six days later, these cell lines 
are tested for cytotoxic activity on 51 Cr labeled Jurkat-A2.1/K b target cells in the absence 
or presence of peptide, and also tested on 51 Cr labeled target cells bearing the 
endogenously synthesized antigen, i.e. cells that are stably transfected with HTV 
expression vectors. 

[00388] The result will demonstrate that CTL lines obtained from animals primed with 

peptide epitope recognize endogenously synthesized HIV antigen. The choice of 
transgenic mouse model to be used for such an analysis depends upon the epitope(s) that is 
being evaluated. In addition to HLA-A*0201/K b transgenic mice, several other transgenic 
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mouse models including mice with human All, which may also be used to evaluate A3 
epitopes, and B7 alleles have been characterized and others (e.g., transgenic mice for 
HLA-A1 and A24) are being developed. HLA-DR1 and HLA-DR3 mouse models have 
also been developed, which may be used to evaluate HTL epitopes. 



EXAMPLE 5. ACTIVITY OF CTL-HTL CONJUGATED EPITOPES IN 

TRANSGENIC MICE 

[00389] This example illustrates the induction of CTLs and HTLs in transgenic mice by use 

of a HIV CTL/HTL peptide conjugate whereby the vaccine composition comprises 
peptides administered to an HIV-infected patient or an individual at risk for HIV. The 
peptide composition can comprise multiple CTL and/or HTL epitopes. This analysis 
demonstrates enhanced immunogenicity that can be achieved by inclusion of one or more 
HTL epitopes in a vaccine composition. Such a peptide composition can comprise an 
HTL epitope conjugated to a preferred CTL epitope containing, for example, at least one 
CTL epitope, or an analog of that epitope. The peptides may be lipidated, if desired. 

[00390] Immunization procedures: Immunization of transgenic mice is performed as 

described (Alexander et al., J. Immunol 159:4753-4761, 1997). For example, A2/K b 
mice, which are transgenic for the human HLA A2.1 allele and are useful for the 
assessment of the immunogenicity of HLA-A*0201 motif- or HLA-A2 supermotif-bearing 
epitopes, are primed subcutaneously (base of the tail) with a 0.1 ml of peptide in 
Incomplete Freund's Adjuvant, or if the peptide composition is a lipidated CTL/HTL 
conjugate, in DMSO/saline or if the peptide composition is a polypeptide, in PBS or 
Incomplete Freund's Adjuvant. Seven days after priming, splenocytes obtained from 
these animals are restimulated with syngenic irradiated LPS-activated lymphoblasts coated 
with peptide. 

[00391] Cell lines: Target cells for peptide-specific cytotoxicity assays are Jurkat cells 

transfected with the HLA-A2.1/K b chimeric gene (e.g., Vitiello et al, J. Exp. Med. 
173:1007, 1991). 

[00392] In vitro CTL activation: One week after priming, spleen cells (30x1 0 6 cells/flask) 
are co-cultured at 37°C with syngeneic, irradiated (3000 rads), peptide coated 
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lymphoblasts (lOxlO 6 cells/flask) in 10 ml of culture medium/T25 flask. After six days, 
effector cells are harvested and assayed for cytotoxic activity. 

[00393] Assay for cytotoxic activity: Target cells (1.0 to 1.5xl0 6 ) are incubated at 37°C in 
the presence of 200 jil of 51 Cr. After 60 minutes, cells are washed three times and 
resuspended in R10 medium. Peptide is added where required at a concentration of 1 
^g/ml. For the assay, 10 4 5l Cr-labeled target cells are added to different concentrations of 
effector cells (final volume of 200 \xl) in U-bottom 96-well plates. After a 6 hour 
incubation period at 37°C, a 0.1 ml aliquot of supernatant is removed from each well and 
radioactivity is determined in a Micromedic automatic gamma counter. The percent 
specific lysis is determined by the formula: percent specific release = 100 x (experimental 
release - spontaneous release)/(maximum release - spontaneous release). To facilitate 
comparison between separate CTL assays run under the same conditions, % 51 Cr release 
data is expressed as lytic units/10 6 cells. One lytic unit is arbitrarily defined as the number 
of effector cells required to achieve 30% lysis of 10,000 target cells in a 6 hour 51 Cr 
release assay. To obtain specific lytic units/10 6 , the lytic units/10 6 obtained in the absence 
of peptide is subtracted from the lytic units/10 6 obtained in the presence of peptide. For 
example, if 30% 51 Cr release is obtained at the effector (E): target (T) ratio of 50:1 (i.e., 
5xl0 5 effector cells for 10,000 targets) in the absence of peptide and 5:1 (i.e., 5xl0 4 
effector cells for 10,000 targets) in the presence of peptide, the specific lytic units would 
be: [(1/50,000X1/500,000)] x 10 6 = 18 LU. 

[00394] The results are analyzed to assess the magnitude of the CTL responses of animals 
injected with the immunogenic CTL/HTL conjugate vaccine preparation and are compared 
to the magnitude of the CTL response achieved using the CTL epitope as outlined in 
above. Analyses similar to this may be performed to evaluate the immunogenicity of 
peptide conjugates containing multiple CTL epitopes and/or multiple HTL epitopes. In 
accordance with these procedures it is found that a CTL response is induced, and 
concomitantly that an HTL response is induced upon administration of such compositions. 
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EXAMPLE 6. SELECTION OF CTL AND HTL EPITOPES FOR EXCLUSION IN AN 

HTV-SPECIFIC VACCESGB. 

[00395] This example illustrates the procedure for the selection of peptide epitopes for 
vaccine compositions of the invention. The peptides in the composition can be in the form 
of a nucleic acid sequence, either single or one or more sequences (i.e., minigene) that 
encodes peptide(s), or can be single and/or polyepitopic peptides. 

[00396] The following principles are utilized when selecting an array of epitopes for 
inclusion in a vaccine composition. Each of the following principles is balanced in order 
to make the selection. 

[00397] Epitopes are selected which, upon administration, mimic immune responses that 
correlate with virus clearance. For example, if it has been observed that patients who clear 
HTV generate an immune response to at least 3 epitopes on at least one HTV antigen, then 
3-4 epitopes should be included for HLA class I. A similar rationale is used to determine 
HLA class II epitopes. 

[00398] When selecting an array of fflV epitopes, it is preferred that at least some of the 
epitopes are derived from early and late proteins. The early proteins of HTV are expressed 
when the virus is replicating, either following acute or dormant infection. Therefore, it is 
particularly preferred to use epitopes from early stage proteins to alleviate disease 
manifestations at the earliest stage possible. 

[00399] Epitopes are often selected that have a binding affinity of an IC 50 of 500 nM or less 
for an HLA class I molecule, or for class II, an IC 50 of 1000 nM or less. 

[00400] Sufficient supermotif bearing peptides, or a sufficient array of allele-specific motif 
bearing peptides, are selected to give broad population coverage. For example, epitopes 
are selected to provide at least 80% population coverage. A Monte Carlo analysis, a 
statistical evaluation known in the art, can be employed to assess breadth, or redundancy, 
of population coverage. 

[00401] When creating a polyepitopic compositions, e.g. a minigene, it is typically 
desirable to generate the smallest peptide possible that encompasses the epitopes of 
interest. The principles employed are similar, if not the same, as those employed when 
selecting a peptide comprising nested epitopes. 

[00402] In cases where the sequences of multiple variants of the same target protein are 
available, potential peptide epitopes can also be selected on the basis of their conservancy. 
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For example, a criterion for conservancy may define that the entire sequence of an HLA 
class I binding peptide or the entire 9-mer core of a class H binding peptide be conserved 
in a designated percentage of the sequences evaluated for a specific protein antigen. 
[00403J Peptide epitopes for inclusion in vaccine compositions are, for example, selected 
from those listed in Tables 6-9 or Figures 1A-4. A vaccine composition comprised of 
selected peptides, when administered, is safe, efficacious, and elicits an immune response 
similar in magnitude of an immune response that clears an acute HIV infection. 



EXAMPLE 7. CONSTRUCTION OF MINIGENE MULTI-EPITOPE DNA PLASMIDS 

[00404] This example provides general guidance for the construction of a minigene 
expression plasmid. Minigene plasmids may, of course, contain various configurations of 
CTL and/or HTL epitopes or epitope analogs as described herein. Expression plasmids 
have been constructed and evaluated as described, for example, in co-pending U.S.S.N. 
09/311,784 filed 5/13/99 and in Ishioka et al, J. Immunol 162:3915-3925, 1999. An 
example of such a plasmid for the expression of HIV epitopes is shown in Figure 2, which 
illustrates the orientation of HIV peptide epitopes in a niinigene construct. 
[00405] A minigene expression plasmid typically includes multiple CTL and HTL peptide 
epitopes. In the present example, HLA-A2, -A3, -B7 supermotif-bearing peptide epitopes 
and HLA-A1 and -A24 motif-bearing peptide epitopes are used in conjunction with DR 
supermotif-bearing epitopes and/or DR3 epitopes (Figure 2). Preferred epitopes are 
identified, for example, in Tables 6-9 and Figures 1A-4. HLA class I supermotif or motif- 
bearing peptide epitopes derived from multiple HTV antigens, are selected such that 
multiple supermotifs/motifs are represented to ensure broad population coverage. 
Similarly, HLA class H epitopes are selected from multiple HTV antigens to provide broad 
population coverage, i.e. both HLA DR-1-4-7 supermotif-bearing epitopes and HLA DR-3 
motif-bearing epitopes are selected for inclusion in the minigene construct. The selected 
CTL and HTL epitopes are then incorporated into a minigene for expression in an 
expression vector. 

[00406] Such a construct may additionally include sequences that direct the HTL epitopes 
to the endoplasmic reticulum. For example, the Ii protein may be fused to one or more 
HTL epitopes as described in co-pending application U.S.S.N. 09/311,784 filed 5/13/99, 
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wherein the CLIP sequence of the E protein is removed and replaced with an HLA class H 
epitope sequence os that HLA class U epitope is directed to the endoplasmic reticulum, 
where the epitope binds to an HLA class H molecules. 

[004071 This example illustrates the methods to be used for construction of a minigene- 
bearing expression plasmid. Other expression vectors that may be used for minigene 
compositions are available and known to those of skill in the art. 

[00408] The minigene DNA plasmid contains a consensus Kozak sequence and a consensus 
murine kappa Ig-light chain signal sequence followed by CTL and/or HTL epitopes 
selected in accordance with principles disclosed herein. The construct can also include, 
for example, The sequence encodes an open reading frame fused to the Myc and His 
antibody epitope tag coded for by the pcDNA 3.1 Myc-His vector. 

[00409] Overlapping ohgonucleotides, for example eight oligonucleotides, averaging 

approximately 70 nucleotides in length with 15 nucleotide overlaps, are synthesized and 
HPLC-purified. The oligonucleotides encode the selected peptide epitopes as well as 
appropriate linker nucleotides, Kozak sequence, and signal sequence. The final 
multiepitope minigene is assembled by extending the overlapping ohgonucleotides in 
three sets of reactions using PCR. A Perkin/Elmer 9600 PCR machine is used and a total 
of 30 cycles are performed using the following conditions: 95°C for 15 sec, annealing 
temperature (5° below the lowest calculated Tm of each primer pair) for 30 sec, and 72°C 
for 1 min. 

[00410] For the first PCR reaction, 5 ug of each of two ohgonucleotides are annealed and 
extended: Ohgonucleotides 1+2, 3+4, 5+6, and 7+8 are combined in 100 ul reactions 
containing PJu polymerase buffer (lx= 10 mM KCL, 10 mM (NH^SO,, 20 mM Tris- 
chloride, P H 8.75, 2 mM MgS0 4 , 0.1% Triton X-100, 100 ug/ml BSA), 0.25 mM each 
dNTP, and 2.5 U of PJu polymerase. The full-length dimer products are gel-purified, and 
two reactions containing the product of 1+2 and 3+4, and the product of 5+6 and 7+8 are 
mixed, annealed, and extended for 10 cycles. Half of the two reactions are then mixed, 
and 5 cycles of annealing and extension carried out before flanking primers are added to 
amplify the full length product for 25 additional cycles. The full-length product is gel- 
purified and cloned into pCR-blunt (Livitrogen) and individual clones are screened by 
sequencing. 
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EXAMPLE 8. THE PLASMID CONSTRUCT AND THE DEGREE TO WHICH IT 

INDUCES IMMUNOGENICITY. 

[00411] The degree to which a plasmid construct, for example a plasmid constructed in 
accordance as above is able to induce immunogenicity can be evaluated in vitro by testing 
for epitope presentation by APC following transduction or transfection of the APC with an 
epitope-expressing nucleic acid construct. Such a study determines "antigenicity" and 
allows the use of human APC. The assay determines the ability of the epitope to be 
presented by the APC in a context that is recognized by a T cell by quantifying the density 
of epitope-HLA class I complexes on the cell surface. Quantitation can be performed by 
directly, measuring the amount of peptide eluted from the APC {see, e.g., Sijts et al, J. 
Immunol. 156:683-692, 1996; Demote et al, Nature 342:682-684, 1989); or the number of 
peptide-HLA class I complexes can be estimated by measuring the amount of lysis or 
lymphokine release induced by infected or transfected target cells, and then detennining 
the concentration of peptide necessary to obtained equivalent levels of lysis or lymphokine 
release {see, e.g., Kageyama et al, J. Immunol. 154:567-576, 1995). 

[00412] Alternatively, immunogenicity can be evaluated through in vivo injections into 
mice and subsequent in vitro assessment of CTL and HTL activity, which are analysed 
using cytotoxicity and proliferation assays, respectively, as detailed e.g., in copending 
U.S.S.N. 09/31 1,784 filed 5/13/99 and Alexander et al, Immunity 1 :751-761, 1994. 

[00413] For example, to assess the capacity of a DNA minigene construct {e.g., a pMin 
minigene construct generated as decribed in U.S.S.N. 09/311,784) containing at least one 
HLA-A2 supermotif peptide to induce CTLs in vivo, HLA-A2.1/K b transgenic mice, for 
example, are immuniz ed intramuscularly with 100 ug of naked cDNA. As a means of 
comparing the level of CTLs induced by cDNA immunization, a control group of animals 
is also immunized with an actual peptide composition that comprises multiple epitopes 
synthesized as a single polypeptide as they would be encoded by the minigene. 

[00414] Splenocytes from immunized animals are stimulated twice with each of the 
respective compositions (peptide epitopes encoded in the minigene or the polyepitopic 
peptide), then assayed for peptide-specific cytotoxic activity in a 51 Cr release assay. The 
results indicate the magnitude of the CTL response directed against the A2-restricted 
epitope, thus indicating the in vivo immunogenicity of the minigene vaccine and 
polyepitopic vaccine. It is, therefore, found that the minigene elicits immune responses 
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directed toward the HLA-A2 supermotif peptide epitopes as does the polyepitopic peptide 
vaccine. A similar analysis is also performed using other HLA-A3 and HLA-B7 
transgenic mouse models to assess CTL induction by HLA-A3 and HLA-B7 motif or 
supermotif epitopes. 

[00415] To assess the capacity of a class II epitope encoding minigene to induce HTLs in 
vivo, DR transgenic mice, or for those epitope that cross react with the appropriate mouse 
MHC molecule, I-A b -restricted mice, for example, are immunized intramuscularly with 
100 ug of plasmid DNA. As a means of comparing the level of HTLs induced by DNA 
immunization, a group of control animals is also immunized with an actual peptide 
composition emulsified in complete Freund's adjuvant. CD4+ T cells, i.e. HTLs, are 
purified from splenocytes of immunized animals and stimulated with each of the 
respective compositions (peptides encoded in the minigene). The HTL response is 
measured using a 3 H-thymidine incorporation proliferation assay, (see, e.g., Alexander et 
al. Immunity 1:751-761, 1994). The results indicate the magnitude of the HTL response, 
thus demonstrating the in vivo immunogenicity of the minigene. 

[00416] DNA minigenes, constructed as described above or below, may also be evaluated 
as a vaccine in combination with a boosting agent using a prime boost protocol. The 
boosting agent can consist of recombinant protein {e.g., Bamett et al, Aids Res. and 
Human Retroviruses 14, Supplement 3:S299-S309, 1998) or recombinant vaccinia, for 
example, expressing a minigene or DNA encoding the complete protein of interest (see, 
e.g., Hanke et al, Vaccine 16:439-445, 1998; Sedegah et al, Proc. Natl Acad. Sci USA 
95:7648-53, 1998; Hanke and McMichael, Immunol Letters 66:177-181, 1999; and 
Robinson et al, Nature Med. 5:526-34, 1999). 

[00417] For example, the efficacy of the DNA minigene used in a prime boost protocol is 
initially evaluated in transgenic mice. In this example, A2.1/K b transgenic mice are 
immunized IM with 100 ug of a DNA mmigene encoding the immunogenic peptides 
including at least one HLA-A2 supermotif-bearing peptide. After an incubation period 
(ranging from 3-9 weeks), the mice are boosted IP with 10 7 pfu/mouse of a recombinant 
vaccinia virus expressing the same sequence encoded by the DNA minigene. Control 
mice are immunized with 100 ug of DNA or recombinant vaccinia without the minigene 
sequence, or with DNA encoding the minigene, but without the vaccinia boost. After an 
additional incubation period of two weeks, splenocytes from the mice are immediately 
assayed for peptide-specific activity in an ELISPOT assay. Additionally, splenocytes are 
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stimulated in vitro with the A2-restricted peptide epitopes encoded in the mini gene and 
recombinant vaccinia, then assayed for peptide-specific activity in an IFN-y ELK A. 

[00418] It is found that the minigene utilized in a prime-boost protocol elicits greater 
immune responses toward the HLA-A2 supermotif peptides than with DNA alone. Such 
an analysis can also be performed using HLA-A11 or HLA-B7 transgenic mouse models 
to assess GTL induction by HLA-A3 or HLA-B7 motif or supermotif epitopes. 

[00419] The use of prime boost protocols in humans is described in below. 



EXAMPLE 9. PEPTIDE COMPOSITION FOR PROPHYLACTIC USES 

[00420] Vaccine compositions of the present invention can be used to prevent HIV 
infection in persons who are at risk for such infection. For example, a polyepitopic 
peptide epitope composition (or a nucleic acid comprising the same) containing multiple 
CTL and HTL epitopes, which are also selected to target greater than 80% of the 
population, is administered to individuals at risk for HTV infection. 

[00421] For example, a peptide-based composition can be provided as a single polypeptide 
that encompasses multiple epitopes. The vaccine is typically administered in a 
physiological solution that comprises an adjuvant, such as Incomplete Freunds Adjuvant. 
The dose of peptide for the initial immunization is from about 1 to about 50,000 ug, 
generally 100-5,000 ug, for a 70 kg patient. The initial administration of vaccine is 
followed by booster dosages at 4 weeks followed by evaluation of the magnitude of the 
immune response in the patient, by techniques that determine the presence of epitope- 
specific CTL populations in a PBMC sample. Additional booster doses are administered 
as required. The composition is found to be both safe and efficacious as a prophylaxis 
against HTV infection. 

[00422] Alternatively, a composition typically comprising transfecting agents can be used 
for the acmunistration of a nucleic acid-based vaccine in accordance with methodologies 
known in the art and disclosed herein. 
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EXAMPLE 10. POLYEPITOPIC VACCINE COMPOSITIONS DERIVED FROM 

NATIVE HIV SEQUENCES 

[00423] A native HTV polyprotein sequence is screened, preferably using computer 
algorithms defined for each class I and/or class H supermotif or motif, to identify 
"relatively short" regions of the polyprotein that comprise multiple epitopes and is 
preferably less in length than an entire native antigen. This relatively short sequence that 
contains multiple distinct, even overlapping, epitopes is selected and used to generate a 
minigene construct. The construct is engineered to express the peptide, which corresponds 
to the native protein sequence. The "relatively short" peptide is generally less than 250 
amino acids in length, often less than 100 amino acids in length, preferably less than 75 
amino acids in length, and more preferably less than 50 amino acids in length. The protein 
sequence of the vaccine composition is selected because it has maximal number of 
epitopes contained within the sequence, i.e., it has a high concentration of epitopes. As 
noted herein, epitope motifs may be nested or overlapping, for example, two 9-mer 
epitopes and one 10-mer epitope can be present in a 10 amino acid peptide. Such a 
vaccine composition is administered for therapeutic or prophylactic purposes. 
[00424] The vaccine composition will preferably include, for example, three CTL epitopes 
and at least one HTL epitope from HTV. This polyepitopic native sequence is 
administered either as a peptide or as a nucleic acid sequence which encodes the peptide. 
Alternatively, an analog can be made of this native sequence, whereby one or more of the 
epitopes comprise substitutions that alter the cross-reactivity and/or binding affinity 
properties of the polyepitopic peptide. 
[00425] The embodiment of this example provides for the possibility that an as yet 
undiscovered aspect of immune system processing will apply to the native nested 
sequence and thereby facilitate the production of therapeutic or prophylactic immune 
response-inducing vaccine compositions. Additionally such an embodiment provides for 
the possibility of motif-bearing epitopes for an HLA makeup that is presently unknown. 
Furthermore, this embodiment (absent analogs) directs the immune response to multiple 
peptide sequences that are actually present in native HTV antigens thus avoiding the need 
to evaluate any junctional epitopes. Lastly, the embodiment provides an economy of scale 
when producing nucleic acid vaccine compositions. 
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[00426] Related to this embodiment, computer programs can be derived in accordance with 
principles in the art, which identify in a target sequence, the greatest number of epitopes 
per sequence length. 

EXAMPLE 11. POLYEPnOPIC VACCINE COMPOSITIONS DIRECTED TO 

MULTIPLE DISEASES 

[00427] The HTV peptide epitopes of the present invention are used in conjunction with 

peptide epitopes from target antigens related to one or more other diseases, to create a 
vaccine composition that is useful for the prevention or treatment of HTV as well as the 
one or more other disease(s). Examples of the other diseases include, but are not limited 
to, HCVandHBV. 

[00428] For example, a polyepitopic peptide composition comprising multiple CTL and 

HTL epitopes that target greater than 98% of the population may be created for 
administration to individuals at risk for both HBV and HIV infection. The composition 
can be provided as a single polypeptide that incorporates the multiple epitopes from the 
various disease-associated sources, or can be administered as a composition comprising 
one or more discrete epitopes. 

EXAMPLE 12. USE OF PEPTIDES TO EVALUATE AN IMMUNE RESPONSE 

[00429] Peptides of the invention may be used to analyze an immune response for the 
presence of specific CTL or HTL populations directed to HIV. Such an analysis may be 
performed in a manner as that described by Ogg et al, Science 279:2103-2106, 1998. In 
the following example, peptides in accordance with the invention are used as a reagent for 
diagnostic or prognostic purposes, not as an immunogen. 

[00430] In this example highly sensitive human leukocyte antigen tetrameric complexes 

("tetramers") are used for a cross-sectional analysis of, for example, HTV HLA-A*0201- 
specific CTL frequencies from HLA A*0201 -positive individuals at different stages of 
infection or following immunization using an HIV peptide containing an A*0201 motif. 
Tetrameric complexes are synthesized as described (Musey et al, N. Engl J. Med. 
337:1267, 1997). Briefly, purified HLA heavy chain (A*0201 in this example) and 02- 
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microglobulin are synthesized by means of a prokaryotic expression system. The heavy 
chain is modified by deletion of the transmembrane-cytosolic tail and COOH-terminal 
addition of a sequence containing a BirA enzymatic biotinylation site. The heavy chain, 
P2-microglobulin, and peptide are refolded by dilution. The 45-kD refolded product is 
isolated by fast protein liquid chromatography and then biotinylated by BirA in the 
presence of biotin (Sigma, St. Louis, Missouri), adenosine 5 'triphosphate and magnesium. 
Streptavidm-phycoerythrin conjugate is added in a 1:4 molar ratio, and the tetrameric 
product is concentrated to 1 mg/ml. The resulting product is referred to as tetramer- 
phycoerythrin. 

[00431] For the analysis of patient blood samples, approximately one million PBMCs are 
centrifuged at 300 x g for 5 minutes and resuspended in 50 ul of cold phosphate-buffered 
saline. Tri-color analysis is performed with the tetramer-phycoerythrin, along with anti- 
CD8-Tricolor, and anti-CD38. The PBMCs are incubated with tetramer and antibodies on 
ice for 30 to 60 min and then washed twice before formaldehyde fixation. Gates are 
applied to contain >99.98% of control samples. Controls for the tetramers include both 
A*0201-negative individuals and A*0201 -positive uninfected donors. The percentage of 
cells stained with the tetramer is then determined by flow cytometry. The results indicate 
the number of cells in the PBMC sample that contain epitope-restricted CTLs, thereby 
readily indicating the extent of immune response to the HIV epitope, and thus the stage of 
infection with HTV, the status of exposure to HTV, or exposure to a vaccine that elicits a 
protective or therapeutic response. 



EXAMPLE 13. USE OF PEPTIDE EPITOPES TO EVALUATE RECALL RESPONSES 

[00432] The peptide epitopes of the invention are used as reagents to evaluate T cell 
responses, such as acute or recall responses, in patients. Such an analysis may be 
performed on patients who have recovered from infection, who are chronically infected 
with HTV, or who have been vaccinated with an HTV vaccine. 

[00433] For example, the class I restricted CTL response of persons who have been 
vaccinated may be analyzed. The vaccine may be any HTV vaccine. PBMC are collected 
from vaccinated individuals and HLA typed. Appropriate peptide epitopes of the 
invention that, optimally, bear supermotifs to provide cross-reactivity with multiple HLA 
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supertype family members, are then used for analysis of samples derived from individuals 
who bear that HLA type. 

[00434] PBMC from vaccinated individuals are separated on Ficoll-Histopaque density 
gradients (Sigma Chemical Co., St. Louis, MO), washed three times in HBSS (GIBCO 
Laboratories), resuspended in RPMI-1640 (GIBCO Laboratories) supplemented with L- 
glutamine (2mM), penicillin (50U/ml), streptomycin (50 ug/ml), and Hepes (lOmM) 
containing 10% heat-inactivated human AB serum (complete RPMI) and plated using 
microculture formats. A synthetic peptide comprising an epitope of the invention is added 
at 10 ug/ml to each well and HBV core 128-140 epitope is added at 1 ug/ml to each well 
as a source of T cell help during the first week of stimulation. 

[00435] In the microculture format, 4 x 10 5 PBMC are stimulated with peptide in 8 
replicate cultures in 96-well round bottom plate in 100 ul/well of complete RPMI. On 
days 3 and 10, 100 ml of complete RPMI and 20 U/ml final concentration of rIL-2 are 
added to each well. On day 7 the cultures are transferred into a 96-well flat-bottom plate 
and restimulated with peptide, rIL-2 and 10 5 irradiated (3,000 rad) autologous feeder cells. 
The cultures are tested for cytotoxic activity on day 14. A positive CTL response requires 
two or more of the eight replicate cultures to display greater than 10% specific 51 Cr 
release, based on comparison with uninfected control subjects as previously described 
(Rehermann, et al, Nature Med. 2:1104,1108, 1996; Rehermann et al, J. Clin. Invest. 
97:1655-1665, 1996; and Rehermann et al. J. Clin. Invest. 98:1432-1440, 1996). 

[00436] Target cell lines are autologous and allogeneic EBV-transformed B-LCL that are 
either purchased from the American Society for ffistocompatibility and Immunogenetics 
(ASffl, Boston, MA) or established from the pool of patients as described (Guilhot, et al. 
J. Virol. 66:2670-2678, 1992). 

[00437] Cytotoxicity assays are performed in the following manner. Target cells consist of 
either allogeneic HLA-matched or autologous EBV-transformed B lymphoblastoid cell 
line that are incubated overnight with the synthetic peptide epitope of the invention at 10 
uM, and labeled with 100 uCi of 51 Cr (Amersham Corp., Arlington Heights, IL) for 1 hour 
after which they are washed four times with HBSS. 

[00438] Cytolytic activity is determined in a standard 4-h, split well 5, Cr release assay 
using U-bottomed 96 well plates containing 3,000 targets/well. Stimulated PBMC are 
tested at effector/target (E/T) ratios of 20-50:1 on day 14. Percent cytotoxicity is 
determined from the formula: 100 x [(experimental release-spontaneous 
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release)/maximum release-spontaneous release)]. Maximum release is determined by lysis 
of targets by detergent (2% Triton X-100; Sigma Chemical Co., St. Louis, MO). 
Spontaneous release is <25% of maximum release for all experiments. 

[00439] The results of such an analysis indicate the extent to which HLA-restricted CTL 

populations have been stimulated by previous exposure to HIV or an HIV vaccine. 

[00440] The class II restricted HTL responses may also be analyzed. Purified PBMC are 

cultured in a 96-well flat bottom plate at a density of 1.5x1 0 5 cells/well and are stimulated 
with 10 p,g/ml synthetic peptide, whole antigen, or PHA. Cells are routinely plated in 
replicates of 4-6 wells for each condition. After seven days of culture, the medium is 
removed and replaced with fresh medium containing lOU/ml IL-2. Two days later, 1 pCi 
3 H-thymidine is added to each well and incubation is continued for an additional 18 hours. 
Cellular DNA is then harvested on glass fiber mats and analyzed for 3 H-thymidine 
incorporation. Antigen-specific T cell proliferation is calculated as the ratio of 3 H- 
thymidine incorporation in the presence of antigen divided by the 3 H-thymidine 
incorporation in the absence of antigen. 



EXAMPLE 14. INDUCTION OF SPECIFIC CTL RESPONSE IN HUMANS 

[00441] A human clinical trial for an immunogenic composition comprising CTL and HTL 

epitopes of the invention is set up as an IND Phase I, dose escalation study and carried out 
as a randomized, double-blind, placebo-controlled trial. Such a trial is designed, for 
example, as follows: 

[00442] A total of about 27 subjects are enrolled and divided into 3 groups: 

Group I: 3 subjects are injected with placebo and 6 subjects are injected with 5 p.g of 
peptide composition; 

Group II: 3 subjects are injected with placebo and 6 subjects are injected with 50 p.g 
peptide composition; 

Group IE: 3 subjects are injected with placebo and 6 subjects are injected with 500 jug of 
peptide composition. 

[00443] After 4 weeks following the first injection, all subjects receive a booster 
inoculation at the same dosage. 
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[00444] The endpoints measured in this study relate to the safety and tolerability of the 

peptide composition as well as its immunogenicity. Cellular immu ne responses to the 
peptide composition are an index of the intrinsic activity of this the peptide composition, 
and can therefore be viewed as a measure of biological efficacy. The following 
summarize the clinical and laboratory data that relate to safety and efficacy endpoints. 

[00445] Safety: The incidence of adverse events is monitored in the placebo and drug 

treatment group and assessed in terms of degree and reversibility. 

[00446] Evaluation of Vaccine Efficacy: For evaluation of vaccine efficacy, subjects are 

bled before and after injection. Peripheral blood mononuclear cells are isolated from fresh 
heparinized blood by Ficoll-Hypaque density gradient centrifugation, aliquoted in freezing 
media and stored frozen. Samples are assayed for CTL and HTL activity. 

[00447] The vaccine is found to be both safe and efficacious. 



EXAMPLE 15. PHASE H TRIALS IN PATIENTS INFECTED WITH HIV 

[00448] Phase II trials are performed to study the effect of administering the CTL-HTL 

peptide compositions to HIV-infected patients. The main objectives of the trials are to 
determine an effective dose and regimen for inducing CTLs in chronically infected HIV 
patients, to establish the safety of inducing a CTL and HTL response in these patients, and 
to see to what extent activation of CTLs improves the clinical picture of chronically 
infected HTV patients, as manifested by a reduction in viral load and an increase in CD4 + 
cells counts. Such a study is designed, for example, as follows: 

[00449] The studies are performed in multiple centers. The trial design is an open-label, 

uncontrolled, dose escalation protocol wherein the peptide composition is administered as 
a single dose followed six weeks later by a single booster shot of the same dose. The 
dosages are 50, 500 and 5,000 micrograms per injection. Drug-associated adverse effects 
(severity and reversibility) are recorded. 

[00450] There. are three patient groupings. The first group is injected with. 50 micrograms 
of the peptide composition and the second and third groups with 500 and 5,000 
micrograms of peptide composition, respectively. The patients within each group range in 
age from 21-65, include both males and females, and represent diverse ethnic 
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backgrounds. All of them are infected with HIV for over five years and are HCV, HBV 
and delta hepatitis virus (HDV) negative, but have positive levels of HTV antigen. 
[00451 J The viral load and CD4 + levels are monitored to assess the effects of adniinistering 
the peptide compositions. The vaccine composition is found to be both safe and 
efficacious in the treatment of HIV infection. 



EXAMPLE 16. INDUCTION OF CTL RESPONSES USING A PRIME BOOST 

PROTOCOL 

[00452] A prime boost protocol can also be used for the acmfinistration of the vaccine to 
humans. Such a vaccine regimen can include an initial administration of, for example, 
naked DNA followed by a boost using recombinant virus encoding the vaccine, or 
recombinant protein/polypeptide or a peptide mixture administered in an adjuvant. 

[00453] For example, the initial immunization is performed using an expression vector, 
such as that constructed above, in the form of naked nucleic acid administered IM (or SC 
or ID) in the amounts of 0.5-5 mg at multiple sites. The nucleic acid (0.1 to 1000 ug) can 
also be administered using a gene gun. Following an incubation period of 3-4 weeks, a 
booster dose is then administered. The booster is, for example, recombinant fowlpox virus 
a&ninistered at a dose of 5-10 7 to 5xl0 9 pfu. An alternative recombinant virus, such as an 
MVA, canarypox, adenovirus, or adeno-associated virus, can also be used for the booster, 
or the polyepitopic protein or a mixture of the peptides can be administered. For 
evaluation of vaccine efficacy, patient blood samples are obtained before immunization as 
well as at intervals following administration of the initial vaccine and booster doses of the 
vaccine. Peripheral blood mononuclear cells are isolated from fresh heparinized blood by 
Ficoll-Hypaque density gradient centrifugation, aliquoted in freezing media and stored 
frozen. Samples are assayed for CTL and HTL activity. 

[00454] Analysis of the results indicates that a magnitude of sufficient response to achieve 
protective immunity against HTV is generated. 
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EXAMPLE 17. ADMINISTRATION OF VACCINE COMPOSITIONS USING 

DENDRITIC CELLS 

[004S5J Vaccines comprising peptide epitopes of the invention can be administered using 
APCs, or "professional" APCs such as DC. In this example, the peptide-pulsed DC are 
administered to a patient to stimulate a CTL response in vivo. In this method, dendritic 
cells are isolated, expanded, and pulsed with a vaccine comprising peptide CTL and HTL 
epitopes of the invention. The dendritic cells are infused back into the patient to elicit 
CTL and HTL responses in vivo. The induced CTL and HTL then destroy or facilitate 
destruction of the specific target cells that bear the proteins from which the epitopes in the 
vaccine are derived. 

[00456] For example, a cocktail of epitope-bearing peptides is administered ex vivo to 
PBMC, or isolated DC therefrom. A pharmaceutical to facilitate harvesting of DC can be 
used,, such as Progenipoietin™ (Monsanto, St. Louis, MO) or GM-CSF/IL-4. After 
pulsing the DC with peptides and prior to reinfusion into patients, the DC are washed to 
remove unbound peptides. 

[00457] As appreciated clinically, and readily determined by one of skill based on clinical 
outcomes, the number of DC reinfused into the patient can vary (see, e.g., Nature Med. 
4:328, 1998; Nature Med. 2:52, 1996 and Prostate 32:272, 1997). Although 2-50 x 10 6 
DC per patient are typically administered, larger number of DC, such as 10 7 or 10 8 can 
also be provided. Such cell populations typically contain between 50-90% DC. 

[00458] In some embodiments, peptide-loaded PBMC are injected into patients without 
purification of the DC. For example, PBMC containing DC generated after treatment with 
an agent such as Progenipoietin™ are injected into patients without purification of the DC. 
The total number of PBMC that are administered often ranges from 10 8 to 10 10 . 
Generally, the cell doses injected into patients is based on the percentage of DC in the 
blood of each patient, as determined, for example, by immunofluorescence analysis with 
specific anti-DC antibodies. Thus, for example, if Progenipoietin™ mobilizes 2% DC in 
the peripheral blood of a given patient, and that patient is to receive 5 x 10 6 DC, then the 
patient will be injected with a total of 2.5 x 10 8 peptide-loaded PBMC. The percent DC 
mobilized by an agent such as Progenipoietin™ is typically estimated to be between 2- 
10%, but can vary as appreciated by one of skill in the art. 
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Ex vivo activation of CTL/HTL responses 

[00459] Alternatively, ex vivo CTL or HTL responses to HIV antigens can be induced by 
incubating in tissue culture the patient's, or genetically compatible, CTL or HTL precursor 
cells together with a source of APC, such as DC, and the appropriate immunogenic 
peptides. After an appropriate incubation time (typically about 7-28 days), in which the 
precursor cells are activated and expanded into effector cells, the cells are infused back 
into the patient, where they will destroy or facilitate destruction of their specific target 
cells. 

[00460] It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview 
of this application and scope of the appended claims. All publications, patents, patent 
applications and sequence listings cited herein are hereby incorporated by reference in 
their entirety for all purposes. 
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TABLE 1 



SUPERMOTIFS 


POSITION 


POSITION 


POSITION 




2 (Primary 
Anchor) 


3 (Primary 
Anchor) 


C Terminus fPrimarv 
Anchor) 


Al 


T,I,Z, V, M, S 




F, Y 


A2 


L,I, \,M,A, T, 
Q 




I, V, M.ATL 


A3 


V, S, M, A, T, L, 
I 




R,K 


A24 


Y, F, W, I V, L, 
M r T 




F, I, r, W.L.M 


B7 


P 




V.LL.F.M JT, r, A 

7 7 "7 7 J ' ' i t ■* -*- 


B27 


R, H, K 




F,Y,L,^M.Z 


B44 


E,Z> 




F, W, L, I, M, V. A 


B58 


A,T,S 




F,W,Y,Z,Z F, Af, A 


B62 


Q,L,Z V, M, P 




F, W, Y, M I V L A I 










MOTIFS 








Al 


T,S,M 




Y 


Al 




D, E,A, S 


Y 


A2.1 


L, M, V, O, L A, 

r 




V. Z, IMA T 


A3 


L, M, V, I, S, A, 

T, F, C, G, D 




K.Y R /7 F 

*^7 X 7 ^7 ^ 


All 


V, T, M, L, I, S, 

7 7 7 7 7 

A, G, N, C, A F 




K«R Y H 


A24 


Y,F,W,M 




F. L, L W 


A*3101 


M,V,T„4,Z,Z 






A*3301 


M,V,A,L,F,Z 
S T 




R, K 


A*6801 


A,V,T,M,S,L, 
I 




R, K 


B*0702 


P 




l»m,f, y.iiz r 


B*3501 


P 




L,M,F,W,Y,Z F, y4 


B51 


P 




UhV,F,W,Y.A.M 


B*5301 


P 




I,M,F,W,Y,4Z, F 


B*5401 


P 




A,T,I,V,Z,M,F, ^ 
Y 



Bolded residues are preferred, italicized residues are tolerated: A peptide is considered 
motif-bearing if it has primary anchors at each primary anchor position for a motif or 
supermotif as specified in the above table. 
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TABLE 2 



SUPERMOTIFS 


POSITION 


POSITION 


POSTTTOM 

X WOX X JLV/JLN 




2 (Primary 
Anchor) 


3 rPrirnarv 

t *■ .1111 LAX y 

Anchor) 


TfMTVllTin C fPrimo-n/ 
V X ClllllUUa 1 a 1 JXLl al y 

Anchor^ 


Al 


T,I,Z, V,M,S 




F W Y' 


A2 


v, a ^, r 




T V 7 A/f 4 T 


A3 


V, S, M, a, r, z, 

/ 




R It 


A24 


Y, F, W, I, V, L, 
M, T 




FT Y W T M \ 


B7 


P 




V LL P M W Y A 


B27 


R,H,K 




F, Y, L, Af, /, V, A 


B58 


A,T,S 




FWY / T V M A 


B62 


Q,L,7, V,M,P 




F,W,Y,M,7, KZ.,4 










MOTIFS 








Al 


T,S,M 




Y 


Al 




D, E,A,S 


Y 


A2.1 


V,Q,A,T H 




V,L,I,M,A, T 


A3.2 


L,M,V,I,S,A, 
T, F, C, G, D 






All 


V,T,M,L,I,S, 
A, G, N, C, D, F 




K,*,i£ Y 


A24 


Y,F,W 




F, L, I, W 



*If 2 is V, or Q, the C-term is not L 



Bolded residues are preferred, italicized residues are tolerated: A peptide is considered 
motif-bearing if it has primary anchors at each primary anchor position for a motif or 
supennotif as specified in the above table. 
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Table 6. Recognition of variant peptides by CTL generated after one and five 
stimulations with the parent peptide. 



A. Gag 386 (VLAEAMSQV) 



Peptide Sequence 


Binding 

IC50(nM) 


i 1 Stimulation 5 Stimulations 

(SU) (SU) 


VLAEAMSQV 

VLAEAMSQA 

VLAEAMSQI 

VLAEAMSKV 

VLAEAMSKA 

ILAEAMSQA 

ILAEAMSKA 

VLAEAMAAA 


49.9 
23.8 
70.9 
230.5 
69.4 
29.3 
72.4 
17 


31.6 222.0 
17.0 133.5 

21.2 246.1 
10.8 130.9 
NT 36.6 
4.0 49.7 

16.3 90.3 



B. Env 134 (KLTPLCVTL) 



KLTPLCVTL 


77.0 


KITPLCVTL 


461 


QLTPLCVTL 


63.6 


QITPLCVTL 


975 


ELTPLCVTL 


7190 


KLTPFCVTL 


87.3 


KLTPLCVIL 


356 


KLTPLCVPL 


14.6 



278.4 
231.8 
166.2 
105.0 
91.7 
36.1 
77.2 
9.6 



683.6 

700.8 

361.5 

166.9 

100.0 

75.4 

29.1 

14.8 
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Table 7. Conservation of EP HIV-1090 epitopes across clades, calculated as identity 
or immunological conservation 



Total Clade B Clade C 

Sequence Identity Imm. Cons. Identity Imm. Cons. Identity Imm. Cons. 



Pol 498 ILKEPVHGV 
Gag 386 VLAEAMSQV 

Pol 448 KLVGKLNWA 
Env 134 KLTPLCVTL 

Vpr 62 RILQQLLFI 
Nef221 LTFGWCFKL 
Gag 271 MTNNPPIPV 


62% 87% 
32% 93% 
95% 96% 
80% 93% 
51% 93% 
49% 74% 
20% 25% 


77% 86% 
68% 91% 
95% 95% 
90% 95% 
68% 91% 

770/. Q4 0/ 

t / /o y i vo 
91% 95% 


74% 95% 
5% 94% 
95% 98% 
89% 98% 
61% 95% 
47% 81% 
8% 19% 


Env 47 VTVYYGVPVWK 
Pol 929 QMAVFIHNFK 
Pol 98 VTIKIGGQLK 
Pol 971 KIQNFRVYYR 
Pol 347 AIFQSSMTK 
Pol 722 KVYLAWVPAHK 
Env 61 TTLFCASDAK 


59% 87% 
84% 98% 
11% 71% 
80% 86% 
53% 75% 
14% 97% 
72% 89% 


95% 100% 
100% 100% 
59% 91% 
91% 95% 
77% 82% 
82% - 95% 
90% 100% 


61% 92% 
94% 97% 

2% 89% 
79% 89% 
44% 79% 

3% 97% 
69% 92% 


Nef94 FPVRPQVPL 
Gag 545 YPLASLRSLF 

Rev 75 VPLQLPPL 
Env 259 IPIHYCAPA 
Gag 237 HPVHAGPIA 
Pol 893 IPYNPQSQGW 
Env 250 CPKVSFEPI 


81% 93% 
7% 29% 
44% 78% 
74% 95% 
27% 54% 
92% 96% 
45% 91% 


77% 95% 
45% 95% 
68% 77% 
45% 95% 
68% 95% 
82% 95% 
77% 100% 


82% 94% 
0% 0% 
27% 79% 
79% 97% 
44% 94% 
240% 97% 
45% 97% 


Mean 54% 81% 77% 93% 
n= 167 22 


59% 84% 
62 
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Table 8. H 


IV Peptides predicted to be immunologically conserved 








1 


1 1 1 1 








Sequence 
Distribution 










Subtype 


Protein 




tons ervect 


All 


A 


B 


C 


D 


G 


Pol 498 


ILKEPVHGV 


xuivur v xx V7 v 




1 


17 


46 


2 


2 






J- JuxCCi ir v rlij v 


12 






5 




1 






X liivCi f v rlvjA 


10 






2 


1 








X Jj i\_L/ ir V riOr V 


o 
a 


5 














VT VT7D"t7"LJ/""«T7 

iSJjxvci iy Vrivj V 


J 
















X Jj ivUX-' VJaCxfi 




2 














X XjJvW P V rlCj V 
































Gag 386 


VLARAMSOV 


VXi A "R A MfiOA 


o / 


z. 


1 


36 


3 


3 






VT.3\ 2VMQ r^TT 




7 


15 


3 


1 








\7T . Z\ TT Z\ M O HT 
V Xi/xCi/iJ vi o y X 


XX 






9 










\7T . A TT" 7\ M CXJ A 

V XxHJ2i/\1Y1 oh/i 


r~ 
D 






4 










X X4AJiili v i o y V 


r- 

5 




3 












X Xxrt±Ltti v i o y/i, 


O 






2 










































Pol 448 


KLVGKXiInTWA 




1 CO 
Xt> O 


9 


21 


59 


3 


3 






J\XjXVjx\XjI\1 WA 


1 






























Env 134 


XVXJ J. JCr J_> V_ V _L I_J 


TTT TUT /'it jipt 
XV Xi 1 xtXjL. V X Xj 


X.34 


8 


19 


55 










QXjTPLjCVTL 


5 


2 


1 












KLjTPLCVAL 


6 
















KXi 1 PXjCVTIj 








3 










KX xPLCVTL 
































Vpr 62 


RILOOKLFT 




86 


1 


15 


28 


4 


3 






KXXiyyijXjJ? V 


21 


2 




2 










RTLQQLLFI 


10 




2 


4 










RTLQQLLFV 


10 






*i 
•i. 










RILQQLLFT 


6 






2 










RMLQQLLFI 


4 




1 


3 










RVLQQLLFI 


3 






3 
























Nef 221 


LTFGWCFKL 


LTFGWCFKL 


82 


8 


17 


29 




3 






LTFGWCYKL 


31 


1 


2 


17 










LTLGWCFKL 


4 






1 
























Gag 271 MTNNPPIPV 


MTSNPPIPV | 


60 


3 




24 


4 


1 
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IY1TJUNPPIPV 


33 




20 


5 










JY1TSNPPVPV 


26 


1 




15 




1 






A/I , TY" T M"Q T TIT T 

l v i 1 va JNI Jr X F V 


15 


5 




1 










M r TV , 'NT"D TOT 7"DT 7 

l v l X vrlNI xr xr V r* V 


9 






5 










i v l 1 JM N hr tr V P V 


6 






6 










M J. ANP PVPV 


3 






2 












* 












Env 4 7 


V X V x Xvjv Jtr V WJ\. 


V X V IX GV P VWK 


99 


6 


21 


30 


3 








V TV YYGVPVWR 


40 


1 




18 










VT I YYGVPVWK 


2 






























Pol 929 


DM A VT? T TTNT T? V 


yWAV if X xi Wr K 


153 


10 


22 


58 


4 


3 






UMAVFVHNF K 


3 






1 










qmavfvhnyk 


2 
































V X XJXXVaijyijlN. 


VSIKVGGQIK 


30 






30 










VTIKIG6QLK 


18 




13 


1 










VTVKIGGQLK 


11 


1 


1 




1 








VTVRIGGQLK 


6 


3 














VSIKVGGQIR 


6 






6 










VSIRVGGQIK 


4 






4 








* 


VTIRIGGQLK 


3 




2 












V 1 V KI GGQLR 


3 


1 














VTVKVGGQLK 


3 






























Pol Q71 


IS.J- VlMr JK.V X IK 


tTTnxTT3nTrinm 

KIQNFRVYYR 


133 


6 


20 


49 


4 


3 




















rOl / 


AxfcQbSIVITK 


AIFQSSMTK 


88 


5 


17 


27 


3 


2 






AIFQCSMTK 


19 




2 


5 










AIFQSSMTR 


13 




1 


11 




1 






AIFQASMTK 


9 


1 




1 










SIFQSSMTK 


9 


3 




6 










AIFQYSMTK 


4 
















AIFQSTMTK 


2 






1 
























Pol 79 9 


IS. V X Xxtt.V\J V irAllJN. 


IvVxXjSWVPAHK 


56 


8 




12 


1 


3 






DT7VT TVTT m n tttj* 

K V x Xj SWVPAHK 


55 






41 










KVYLAWVPAHK 


23 


1 


18 




3 








KVYLTWVPAHK 


5 




2 












KIYL SWVPAHK 


5 






3 










RIYLSWVPAHK 


5 






4 










KIYLAWVPAHK 


2 




1 












QVYLTWPAHK 


2 






























Env 61 


TTLPCASDAK 


TTLFCASDAK 


121 


9 


19 


41 


4 


1 






ATLFCASDAK 


7 






7 
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TILFCASDAK 
PTLFCASDAK 
xTLFCASDAR 
TTLFCASEAK 


6 
2 
2 




2 


1 











nl lir UASDAR 


2 
2 




1 


2 






Nef 94 


FPVRPQVPL 


FPVRPQVPL 

FPVKPQVPL 


135 
9 


8 
1 


17 
3 


51 
2 


4 


3 






FPVRPQVPV 


4 






2 






Gag 545 


YPLASLRSLF 


EPLTSLKSLF* 


22 






21 






*These tv\ 
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[peptides PrediCted ideological conservation for a panel of HLA-A2 rest: 
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Table 10. 167 HIV-1 Variants 



Sequence 


Name 


Accession 


SubType 


Count 


Designation 




Number 


A.UG.92UG037 U51190 


92UG037 


U51190 


A 


UG 


A.BY.97BL006 AF1932 


97BL006 


AF1 93275 


A 


BY 


A.KE.Q23 AF004885 


Q23 


AF004885 


A 


KE 


A.SE.SE6594 AF06967 


SE6594 


AF069672 


A 


SE 


A.SE.SE7253 AF06967 


SE7253 


AF069670 


A 


SE 


A.SE.SE7535 AF06967 


SE7535 


AF069671 


A 


SE 


A.SE.SE8538 AF06966 


SE8538 


AF069669 


A 


SE 


A.SE.SE8891 AF06967 


SE8891 


AF069673 


A 


SE 


A.UG.U455 M62320 


U455 


M62320 


A 


UG 


A.SE.UGSE8131 AF107 


UGSE8131 


AF1 07771 


A 


SE 


A2.CY.94CY017.41 AF 


94CY017.41 


AF286237 


A2 


CY 


A2.CD.97CDKTB48 AF2 


97CDKTB48 


AF286238 


A2 


CD 


A2D.-.97KR004 AF286 


97KR004 


AF286239 


A2D 


KR 


A2G.CD.97CDKP58 AF3 


97CDKP58 


AF316544 


A2G 


CD 


AC.IN.21301 AF06715 


21301 


AF067156 


AC 


IN 


AC.RW.92RW009 U8882 


92RW009 


U88823 


AC 


RW 


AC.SE.SE9488 AF0714 


SE9488 


AF071474 


AC 


SE 


ACD.SE.SE8603 AF075 


SE8603 


AF075702 


ACD 


SE 


ACG.BE.VI1035 AJ276 


V1 1035 


AJ276595 


ACG 


BE 


AD.SE.SE6954 AF0757 


SE6954 


AF075701 


AD 


SE 


AD.SE.SE7108 AF0714 


SE7108 


AF071473 


AD 


SE 


ADHK.N0.97NOGIL3 AJ 


97NOGIL3 


AJ237565 


ADHK 


NO 


ADK.CD.MAL X04415 


MAL 


X04415 


ADK 


CD 


AG.NG.92NG003 U8882 


92NG003 


U88825 


AG 


NG 


AG.BE.VI1197 AJ2765 


VI1197 


AJ276596 


AG 


BE 


AGHU.GA.VI354 AF076 


VI354 


AF076474 


AGHU 


GA 


AGU.CD.Z321 U76035 


Z321 


U76035 


AGU 


CD 


AJ.BW.BW2117 AF1921 


BW2117 


AF192135 


AJ 


BW 


B.NL.3202A21 U34604 


3202A21 


U34604 


B 


NL 


B.US.BC L02317 


BC 


L02317 


B 


US 


B.GB.CAM1 D10112 


CAM1 


D10112 


B 


GB 


B.DE.D31 U43096 


D31 


U43096 


B 


DE 


B.US.DH123 AF069140 


DH123 


AF069140 


B 


US 


B.GB.GB8.C1 Y13716 


GB8 


AJ271445 


B 


GB 


B.DE.HAN U43141 


HAN 


U43141 


B 


DE 


B.FR.HXB2 K03455 


HXB2 


K03455 


B 


FR 


B.US.JRCSF M38429 


JRCSF 


M38429 


B 


US 


B.GB.MANC U23487 


MANC 


U23487 


B 


GB 


B.US.MNCG M17449 


MNCG 


M17449 


B 


US 


B.GA.OYI, M26727 


OYI 


M26727 


B 


GA 


B.US.P896 U39362 


P896 


M96155 


B 


US 


B.US.RF M17451 


RF 


M17451 


B 


US 


B.CN.RL42 U71182 


RL42 


U71182 


B 


CN 


B.US.SF2 K02007 


SF2 


K02007 


B 


US 


B.TW.TWCYS AF086817 


TWCYS 


AF086817 


B 


TW 


B.AU.VH AF1 46728 


VH 


AF146728 


B 


AU 


B.US.WEAU160 U21135 


WEAU160 


U21135 


B 


US 


B.KR.WK AF224507 


WK 


AF224507 


B 


KR 


B.US.WR27 U26546 


WR27 


U26546 


B 


US 
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B. US.YU2_M93258 
BF1 .BR.93BR029.4_AF 

C. BR.92BR025_U52953 
C.IN.93IN101_AB0238 
C.IN.93IN904_AF0671 
C.IN.93IN999_AF0671 
C.IN.94IN11246_AF06 
C.IN.95IN21068_AF06 
C.BW.96BW0402_AF1 10 
C.BW.96BW1210_AF110 
C.BW.96BW15B03_AF1 1 
C.ET.ETH2220_U4601 6 
C.BW.96BW1 1 B01_AF1 1 
C.BW.00BW0762.1_AF44 
C.BW.00BW0768.20_AF44 
C.BW.00BW0874.21_AF44 
C.BW.00BW1471 .27_AF44 
C. BW.00BW1 6 1 6.2_AF44 
C.BW.00BW1686.8_AF44 
C.BW.00BW1 759.3_AF44 
C.BW.00BW1 773.2_AF44 
C. BW.00BW1 783.5_AF44 
C.BW.00BW1795.6_AF44 
C.BW.00BW1 81 1 .3_AF44 
C.BW.00BW1 859.5_AF44 
C.BW.00BW1880.2_AF44 
C.BW.00BW1 921 .1 3_AF44 
C. BW.00BW2036. 1 _AF44 
C.BW.00BW2063.6_AF44 
C.BW.00BW2087.2_AF44 
C.BW.00BW2127.214_AF44 
C.BW.00BW21 28.3_AF44 
C.BW.00BW2276.7_AF44 
C.BW.0OBW381 9.3_AF44 
C.BW.00BW3842.8_AF44 
C. BW.00BW3871 .3_AF44 
C. BW.00BW3876.9_AF44 
C. BW.00BW3886.8_AF44 
C.BW.00BW3891 .6_AF44 
C.BW.00BW3970.2_AF44 
C.BW.00BW5031 .1_AF44 
C.BW.96BW01 B21_AF1 1 
C.BW.96BW0407_AF1 1 
C.BW.96BW0502_AF1 1 
C.BW.96BW06.J4_AF29 
C.BW.96BW1 1 .06_AF1 1 
C.BW.96BW1210_AF11 
C.BW.96BW15B03_AF1 1 
C.BW.96BW16.26_AF1 1 
C.BW.96BW17A09_AF1 1 
C.BW.96BWM01 .5_AF44 
C.BW.96BWM03.2_AF44 
C.BW.98BWMC12.2_AF44 
C.BW.98BWMC1 3.4_AF44 
C.BW.98BWMC14.a3_AF44 
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orx i f 


Mr i f yoDo 


PRC1 1" 

ur\r i l_ 


_cpx 


PD 


H PD ft47R0ft^ I Jftftft99 


ft47Rnft^ 


I Iftftft99 


u 




pn 
ou 


D UG 94UG1 141 U8889 


Q4I IG1 141 
y*tvJO 1 l*r I 


I Iftftft94 


n 
u 




1 ip 

UO 


n pn fi i ko^4^4 


Fl 1 


k'O'^A^/l 
l\UO £ fO*f 


n 
u 




pn 


D CD NDK M27323 

L/.VL/.IMUI\ IVIZ- / 




M97^9^ 
IVI^f 0£.0 


D 




pn 
uu 


F1 BR 93BR020 1 AFO 


Q^RR09n 1 


AF00*S4Q4 


F1 




RR 

Drv 


F1 Fl FIN9363 AF075 


FINQ^fi^ 


AF07^70^ 
Mi U/ Or UO 


F1 




pi 

r 1 


F1 FR MP41 1 AJ24923 

i i.i ix.ivii ~ i i rvu&i^wbU 


MP41 1 * 


A l94Q9^ft 

MO 449^.00 


F1 




FR 

rrv 


F1 BE VI850 AF07733 


v/ift^n 


AF077^aR 

Ml Uf 1 OOO 


F1 




RF 
DC 


F2 CM MP257 AJ24923 


MP9S7 


A I94Q9a7 


F2 




PM 

Ol VI 


F2KUBEVI1126 AF07 

1 £>I\V< VII I £.V^ /VI v/ f 


VI1 19R 

VIII £-\J 


AF07R47 c i 

Ml U 1 U*T f O 


F2KU 


RF 

DC 


G NG 92NG083 U88826 

• 1 NViWfal xwwUv UUUUfaV 


QPNGfJft^ 


I Jftftft9R 


G 




NG 
i\o 


G.BE.DRCBL AF084936 


DRCBL 


AF084936 


G 




BE 


G.SE.SE6165 AF06164 


SE6165 


AF061642 


G 




SE 


H.CF.90CF056* AF0054 


90CF056 


AF005496 


H 




CF 


H.BE.VI991 AF1 90127 


VI991 


AF190127 


H 




BE 


H.BE.VI997 AF190128 


VI997 


AF190128 


H 




BE 


J.SE.SE7022 AF08239 


SE7022 


AF082395 


J 




SE 


J.SE.SE7887 AF08239 


SE7887 


AF082394 


J 




SE 
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K CD FOTR11C AJ2492 


FOTR1 1 




l\ 




K CM MP535 AJ249239 


MP535 


A I24Q9^Q 


IX 




N.CM.YBF30 AJ006022 


YBF30 


AJ006022 


N 


CM 


O.SN.99SE-MP1299 ZX 


SEMP1299 


AJ302646 


O 


SN 


O.SN.99SE-MP1300 ZX 


SEMP1300 


AJ302647 


0 


SN 


O.CM.ANT70 L20587 


ANT70 


L20587 


O 


CM 


O.CM.MVP5180 L20571 


MVP5180 


L20571 


O 


CM 


U.CD.-.83CD0031 


83CD0031 


AF286236 


U 


CD 
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Table 11. HIV Gag Sequence Alignment 
GCG Multiple Sequence File. 
Written by Omiga 1.1 



Name : 


00BW0762_1 


Len 


: 556 


Name : 


00BW0768_2 


Len 


: 556 


Name : 


00BW0874_2 


Len 


: 556 


Name : 


00BW1471_2 


Len 


: 556 


Name : 


00BW1616 2 


Len 


: 556 


Name : 


00BW1686_8 


Len 


: 556 


Name : 


0OBW1759_3 


Len 


: 556 


Name : 


00BW1773_2 


Len 


556 


Name : 


00BW1783_5 


Len 


: 556 


Name ; 


00BW1795_6 


Len 


: 556 


Name : 


0OBW1811_3 


Len 


: 556 


Name : 


00BW1859_5 


Len 


: 556 


Name : 


00BW1880_2 


Len 


: 556 


Name : 


00BW1921_1 


Len 


556 


Name : 


00BW2 036 1 


Len 


: 556 


Name : 


00BW2063_6 


Len 


: 556 


Name : 


00BW2087_2 


Len 


: 556 


Name : 


00BW2127_2 


Len 


: 556 


Name: 


00BW2128_3 


Len 


556 


Name: 


00BW2276_7 


Len 


: 556 


Name: 


00BW3 819_3 


Len 


: 556 


Name : 


O0BW3842^8 


Len 


: 556 


Name : 


O0BW3871_3 


Len 


: 556 


Name : 


00BW3 876_9 


Len 


: 556 


Name : 


00BW3 886_8 


Len 


556 


Name : 


00BW3891_6 


Len 


: 556 


Name : 


0OBW3970_2 


Len 


: 556 


Name: 


00BW5031_1 


Len 


: 556 


Name : 


96BW01B21 


Len 


: 556 


Name : 


96BW0407 


Len 


; 556 


Name : 


96BW0502 


Len 


: 556 


Name : 


96BW06_J4 


Len 


: 556 


Name : 


96BW11_06 


Len 


: 556 


Name : 


96BW1210 


Len 


: 556 


Name : 


96BW15B03 


Len 


: 556 


Name : 


96BW16_26 


Len 


: 556 


Name: 


96BW17A09 


Len 


: 556 


Name : 


96BWM01_5 


Len 


: 556 


Name : 


96BWMQ3_2 


Len 


: 556 


Name : 


98BWMC12J2 


Len 


: 556 


Name: 


98BWMC13_4 


Len 


556 


Name : 


98BWMC14_a 


Len 


: 556 


Name : 


98BWM014_1 


Len 


556 


Name : 


98BWM018_d 


Len 


556 


Name : 


98BWM036_a 


Len 


556 


Name: 


98BWM037_d 


Len- 


556 


Name: 


99BW3932__1 


Len: 


556 


Name : 


99BW4642_4 


Len. 


556 


Name: 


99BW4745_8 


Len 


556 


Name : 


99BW4754_J7 


Len: 


556 


Name : 


99BWMC16__8 


Len: 


556 


Name: 


A2_CD_97CD 


Len: 


556 


Name : 


A2__CY_94CY 


Len: 


556 


Name : 


A2D 97KR 


Len: 


556 


Name : 


A2G_CD_ 97C 


Len: 


556 


Name : 


AJBY_97BL0 


Len: 


556 


Name : 


A_KE_Q23_A 


Len: 


556 



PCT/US2004/009510 



Check: 


2513 


Weight 


: 1.00 


Check: 


8965 


Weight 


: 1.00 


Check: 


9574 


Weight 


: 1 . 00 


Check: 


5395 


Weight 


: 1.00 


Check: 


4692 


Weight 


: 1.00 


Check : 


7822 


Weight 


: 1 . 00 


Check : 


7777 


Weight 


: 1.00 


Check: 


9727 


Weight 


: 1.00 


Check : 


9681 


Weight 


: 1.00 


Check: 


9667 


Weight 


: 1.00 


Check : 


4422 


We ight 


: 1 . 00 


Check : 


7320 


Weight 


: 1 . 00 


Check : 


1603 


Weight 


: 1.00 


Check: 


883 


Weight : 


1.00 


Check: 


2591 


Weight 


: 1.00 


Check : 


5152 


Weight 


: 1.00 


Check : 


5183 


Weight 


: 1.00 


Check : 


5469 


Weight 


: 1.00 


Check: 


9621 


Weight 


: 1.00 


Check : 


4153 


Weight 


: 1.00 


Check: 


4227 


Weight 


: 1.00 


Check: 


9312 


Weight 


1.00 


Check : 


501 


Weight : 


1 . 00 


Check : 


773 


Weight : 


1 . 00 


Check : 


2351 


Weight 


: 1.00 


Check : 


129 


Weight : 


1 . 00 


Check : 


8768 


Weight 


: 1.00 


Check : 


3966 


Weight 


; 1.00 


Check : 


602 


Weight : 


1 . 00 


Check : 


9836 


Weight 


: 1.00 


Check : 


6402 


We ight 


1.00 


Check : 


254 


Weight : 


1 . 00 


Check : 


6801 


Weight 


1 . 00 


Check : 


6016 


Weight : 


1 . 00 


Check : 


6072 


Weight . 


1.00 


Check : 


9409 


Weight 


1 . 00 


Check : 


2723 


Weight : 


1 . 00 


Check : 


5051 


Weight : 


1 . 00 


Check: 


496 


Weight : 


1 . 00 


Check : 


1164 


Weight : 


1 . 00 


Check: 


4961 


Weight : 


1 . 00 


Check : 


7351 


Weight : 


1. 00 


Check : 


288 


Weight : 


1.00 


Check : 


6836 


Weight : 


1.00 


Check : 


4386 


Weight : 


1.00 


Check : 


6900 


Weight : 


1.00 


Check : 


292 


Weight : 


1.00 


Check : 


1347 


Weight: 


1.00 


Check : 


7980 


Weight : 


1.00 


Check : 


9892 


Weight: 


1.00 


Check : 


4279 


Weight : 


1.00 


Check : 


9920 


Weight: 


1.00 


Check : 


3070 


Weight : 


1.00 


Check: 


6350 


Weight : 


1.00 


Check : 


856 


Weight : 


1.00 


Check : 


9315 


Weight : 


1.00 


Check : 


2442 


Weight : 


1.00 
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Name: 


A_SE_SE659 


Len: 


556 


Name : 


A_SE_SB725 


Len: 


556 


Name: 


A__SE_SE753 


Len: 


556 


Name : 


A_SE_SE853 


Len: 


556 


Name: 


A_SE_SE889 


Len: 


556 


Name : 


A_SE_UGSE8 


Len: 


556 


Name: 


A_UG_92UG0 


Len: 


556 


Name: 


A_UG_JJ455 


Len: 


556 


Name : 


AC_IN_213 0 


Len: 


556 


Name : 


AC_RW_92RW 


Len: 


556 


Name : 


AC_SE_SE94 


Len: 


556 


Name : 


ACD_SE_SE8 


Len: 


556 


Name : 


ACG_BE_VI1 


Len: 


556 


Name : 


AD_SE_SE69 


Len: 


556 


Name : 


AD_SE_SE71 


Len: 


556 


Name : 


ADHK_NO_97 


Len: 


556 


Name : 


ADK_CD_MAL 


Len: 


556 


Name 


AG_BE_VI11 


Len: 


556 


Name : 


AG_NG 92NG 


Len: 


556 


Name : 


AGHUJ3A_VI 


Len: 


556 


Name : 


AGU_CD_Z32 


Len: 


556 


Name : 


AJ_BW_BW21 


Len: 


556 


Name : 


B_AU_VH_AF 


Len: 


556 


Name : 


B_CN_RL42__ 


Len: 


556 


Name : 


B_DE_D31_U 


Len: 


556 


Name : 


B_DE_HAN_U 


Len: 


556 


Name : 


B_FR__HXB2__ 


Len: 


556 


Name : 


B_GA_OYI 


Len: 


556 


Name : 


B_GB_CAM1_ 


Len: 


556 


Name : 


B_GB_GB8_A 


Len: 


556 


Name : 


B_GB_MANC_ 


Len: 


556 


Name: 


B__KR_WK_AF 


Len: 


556 


Name: 


B_NL_32 02A 


Len: 


556 


Name : 


B_TWJTWCYS 


Len: 


556 


Name: 


B_US_BC_L0 


Len: 


556 


Name : 


B_JJS_DH123 


Len: 


556 


Name : 


B_US_JRCSF 


Len: 


556 


Name : 


B_US_MNCG_ 


Len: 


556 


Name : 


B_US_P896_ 


Len: 


556 


Name : 


B_US_RF_M1 


Len: 


556 


Name : 


B__US_SF2_K 


Len: 


556 


Name: 


B_US_WEAU1 


Len: 


556 


Name : 


B_US__WR27__ 


Len: 


556 


Name : 


B_US_YU2_M 


Len: 


556 


Name : 


BF1JBR_93B 


Len : 


556 


Name: 


C_BR_92BR0 


Len: 


556 


Name: 


C_BW__96BW0 


Len: 


556 


Name : 


C_BW_96BW1 


Len: 


556 


Name : 


C_BW_96BW1 


Len: 


556 


Name : 


C_BW_96BW1 


Len : 


556 


Name : 


C_ETJ5TH22 


Len: 


556 


Name : 


C_IN_93IN1 


Len: 


556 


Name : 


C_IN_93IN9 


Len: 


556 


Name : 


C_IN_93IN9 


Len: 


556 


Name : 


C__IN_94IN1 


Len: 


556 


Name : 


C_IN_95IN2 


Len: 


556 


Name : 


CRF01_AE__C 


Len: 


556 


Name : 


CRF01_AE_C 


Len: 


556 


Name : 


CRF01_AE__C 


Len: 


556 


Name : 


CRF01_AE_T 


Len: 


556 


Name : 


CRF01_AE__T 


Len: 


556 



PCT/US2004/009510 



Check : 


8612 


Weight : 


1.00 


Check: 


8315 


Weight: 


1.00 


Check : 


2915 


Weight : 


1.00 


Check : 


9112 


Weight : 


1. 00 


Check: 


8732 


Weight: 


1. 00 


Check : 


8696 


Weight : 


1.00 


Check: 


6290 


Weight : 


1.00 


Check : 


164 


Weight : 


1.00 


Check: 


8482 


Weight : 


1. 00 


Check: 


977 


Weight : 


1.00 


Check : 


8752 


Weight : 


1. 00 


Check: 


9655 


Weight : 


1. 00 


Check : 


3777 


Weight : 


1. 00 


Check: 


732 


Weight : 


1.00 


Check : 


8506 


Weight : 


1. 00 


Check : 


5257 


Weight : 


1. 00 


Check: 


5301 


Weight : 


1.00 


Check : 


1610 


Weight : 


1.00 


Check : 


4188 


Weight : 


1.00 


Check : 


8242 


Weight : 


1.00 


Check : 


2601 


Weight : 


1.00 


Check : 


8389 


Weight : 


1.00 


Check: 


9166 


Weight : 


1.00 


Check : 


3865 


Weight : 


1.00 


Check : 


9464 


Weight : 


1.00 


Check: 


6999 


Weight : 


1.00 


Check : 


141 


Weight : 


1.00 


Check: 


7578 


Weight: 


1.00 


Check : 


8023 


Weight : 


1.00 


Check: 


2675 


Weight : 


1.00 


Check: 


8961 


Weight : 


1.00 


Check : 


7372 


Weight : 


1.00 


Check : 


7117 


Weight : 


1.00 


Check : 


8767 


Weight: 


1.00 


Check : 


7091 


Weight : 


1.00 


Check : 


5049 


Weight : 


1.00 


Check : 


975 


Weight : 


1.00 


Check: 


688 


Weight : 


1.00 


Check : 


8809 


Weight : 


1.00 


Check: 


9306 


Weight : 


1.00 


Check : 


9799 


Weight : 


1.00 


Check : 


9636 


Weight : 


1.00 


Check: 


3349 


Weight: 


1.00 


Check: 


8828 


Weight : 


1.00 


Check : 


7935 


Weight: 


1.00 


Check: 


4429 


Weight : 


1.00 


Check: 


1550 


Weight : 


1.00 


Check : 


7158 


Weight: 


1.00 


Check: 


6016 


Weight : 


1.00 


Check : 


6072 


Weight : 


1.00 


Check : 


4314 


Weight : 


1.00 


Check : 


6959 


Weight: 


1.00 


Check: - 


9362 


Weight : 


1.00 


Check : 


3298 


Weight: 


1.00 


Check: 


6744 


Weight.: 


1.00 


Check : 


8559 


Weight: 


1.00 


Check: 


4763 


Weight : 


1.00 


Check: 


4315 


Weight : 


1.00 


Check : 


3920 


Weight : 


1.00 


Check: 


5074 


Weight : 


1.00 


Check : 


4026 


Weight : 


1. 00 
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Name : CRFO 1__AE_T Len : 

Name : CRF01_AE_JT Len : 

Name : CRF 0 1_AE_T Len : 

Name : CRF 0 1_AE_T Len : 

Name : CRF02_AG_F Len : 

Name : CRF02_AG_F Len : 

Name : CRF 0 2_AG_G Len : 

Name : CRF 0 2_AG_N Len : 

Name : CRF 0 2 _AG_S Len : 

Name : CRFO 2__AG__S Len : 

Name : CRF03_AB_R Len : 

Name : CRF 0 3_AB_R Len : 

Name : CRF 0 4_cpx_ Len : 

Name : CRF04_cpx_ Len : 

Name : CRF04_cpx__ Len : 

Name : CRF 0 5_DF_B Len : 

Name : CRF05_DF_B Len : 

Name : CRFO 6_cpx_ Len : 

Name : CRFO 6_cpx_ Len : 

Name : CRF 0 6_cpx_ Len : 

Name : CRF 0 6__cpx__ Len : 

Name : CRF1 l_cpx_ Len : 

Name : CRF1 l__cpx_ Len : 

Name : D_CD_84ZR0 Len : 

Name : D_CD__ELI_K Len : 

Name: D_CD__NDK__M Len: 

Name: D_UG_94UG1 Len: 

Name: F1_BE_VI85 Len: 

Name: F1_BR_ 93BR Len: 

Name : F1_FI_ FIN9 Len : 

Name : F1_FR_MP4 1 Len : 

Name : F2_CM_MP2 5 Len : 

Name : F2KU__BE__VI Len : 

Name : G_BE__DRCBL Len : 

Name : G_NG_92NG0 Len : 

Name : G_SE_SE616 Len: 

Name: H_BE_VI991 Len: 

Name: HJBE_VI997 Len: 

Name: H_CF_90CF0 Len: 

Name : J_SE_JSE7 02 Len : 

Name: J_SE_SE788 Len: 

Name: K_CD_EQTB1 Len: 

Name : K_CMJVIP53 5 Len : 

Name: N_CM_YBF30 Len: 

Name: O_CM__ANT70 Len: 

Name : 0_CM__MVP5 1 Len : 

Name: 0_SN_99SE_ Len: 

Name : 0_SN__99SE_ Len : 

Name: U CD 83C Len: 



// 



556 


Check: 


2981 


Weight 


: 1 


.00 


556 


Check : 


6648 


Weight 


: 1 


. 00 


556 


Check : 


6948 


Weight 


: 1 


. 00 


556 


Check : 


6129 


Weight 


: 1 


. 00 


556 


Check : 


8134 


Weight 


1 


. 00 


556 


Check: 


8925 


Weight 


: 1 


.00 


556 


Check : 


9991 


Weight 


: 1 


. 00 


556 


Check : 


9388 


Weight 


: 1 


.00 


556 


Check: 


9371 


Weight 


: 1 


. 00 


556 


Check: 


3289 


Weight 


: 1 


. 00 


556 


Check: 


5554 


Weight 


: 1 


. 00 


556 


Check : 


5307 


Weight 


: 1 


. 00 


556 


Check : 


5488 


Weight 


: 1 


. 00 


556 


Check : 


6046 


Weight 


: 1 


. 00 


556 


Check : 


3807 


Weight 


: 1 


. 00 


556 


Check: 


6702 


Weight 


1 


. 00 


556 


Check: 


5563 


Weight 


: 1 


. 00 


556 


Check : 


8164 


Weight 


: 1 


. 00 


556 


Check : 


1975 


Weight 


: 1 


. 00 


556 


Check: 


1486 


Weight 


: 1 


. 00 


556 


Check: 


4050 


Weight 


: 1 


. 00 


556 


Check: 


1515 


Weight 


: 1 


.00 


556 


Check: 


9474 


Weight 


: 1 


.00 


556 


Check : 


4739 


Weight 


s 1 


. 00 


556 


Check: 


8533 


Weight 


1 


.00 


556 


Check: 


6100 


Weight : 


1 


.00 


556 


Check: 


7681 


Weight : 


1 


. 00 


556 


Check : 


8540 


Weight : 


1 


. 00 


556 


Check: 


241 


Weight : 


1.00 


556 


Check : 


7510 


Weight : 


1 


00 


556 


Check: 


9846 


Weight: 


1. 


00 


556 


Check : 


1247 


Weight : 


1. 


00 


556 


Check : 


2186 


Weight : 


1. 


00 


556 


Check: 


8049 


Weight : 


1. 


00 


556 


Check: 


254 


Weight : 


1.00 


556 


Check: 


6407 


Weight : 


1. 


00 


556 


Check : 


808 


Weight : 


1.00 


556 


Check: 


4749 


Weight : 


1. 


00 


556 


Check: 


4718 


Weight : 


1. 


00 


556 


Check : 


2564 


Weight : 


1. 


00 


556 


Check: 


2692 


Weight : 


1. 


00 


556 


Check : 


6586 


Weight : 


1. 


00 


556 


Check: 


3803 


Weight : 


1. 


00 


556 


Check: 


8603 


Weight : 


1. 


00 


556 


Check: 


365 


Weight : 


1.00 


556 


Check: 


9160 


Weight : 


1. 


00 


556 


Check : 


9380 


Weight : 


1. 


00 


556 


Check: 


8796 


Weight: 


1. 


00 


556 


Check : 


4633 


Weight : 


1. 


00 



00BW0762_1 
0OBWO768_2 
00BW0874_2 
00BW1471_2 
00BW1616_2 
00BW1686_8 
00BW1759_3 
00BW1773 2 



1 

MGARASILRG 
MGARASVLRG 
MGARASILRG 
MGARASILRG 
MGARASILRG 
MGARASILRG 
MGARASILRG 
MGAS AS I LRG 



.EKLDKWEKI 
. EKLDKWEKI 
. GKLDTWEKI 
. GKLDTWEKI 
. GKLDEWEKI 
. GKLDTWEKI 
. GKLDKWERI 
. GKLDKWEKI 



RLRPGGRKHY 
RLRPGGKKHY 
RLRPGGKKQY 
RLRPGGKKHY 
RLRPGGKKRY 
RLRPGGKKHY 
RLRPGGKKHY 
RLRPGGKKKY 



MIKHIVWASR 
MLKHLVWASR 
MIKHLVWASR 
MLKHLVWASR 
MMKHLVWASR 
MIKHLVWASR 
MLKHLVWASR 
RLKHLVWASR 



50 

ELERFALNPG 
ELERFALNPG 
ELERFALNPG 
ELERFALNPG 
ELERFALNPG 
ELERFALNPG 
ELERFALNPG 
ELERFALNSG 



141 



WO 2005/012502 



PCI7US2004/009510 



00BW1783_5 
00BW1795_6 
00BW1811_3 
00BW1859_5 
00BW1880_2 
00BW1921_1 
OOBW2 036_1 
00BW2063_6 
0OBW2O87_2 
00BW2127_2 
00BW2128_3 
00BW2276_7 
00BW3819_3 
00BW3 842_8 
00BW3871_3 
00BW3 876_9 
00BW3 886_8 
00BW3891_6 
00BW3970_2 
00BW5031_1 
96BW01B21 
96BW0407 
96BW0502 
96BW06__J4 
96BW11_06 
96BW1210 
96BW15B03 
96BW16_26 
96BW17A09 
96BWM01_5 
96BWM03_2 
98BWMC12_2 
98BWMC13_4 
98BWMC14_a 
98BWM014_1 
98BWMO!8_d 
98BWM036_a 
98BWM037_d 
99BW3 932__1 
99BW4642_4 
99BW4745_8 
99BW4754_7 
99BWMC16_8 
A2_CD_97CD 
A2_CY_94CY 
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00BW0762_1 AGTTSTLQEQ IAWMTS.NPP VPVGEIYKRW I ILGLNKIVR MYSPVSI^ 
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00BW2036_1 AGTTSTLQEQ IAWMTS.NPP IPVGDIYKRW IILGLNKIVR SsP^SlSl 
O0BW2063_6 AGTTSTLAEQ IAWMTS.NPP IPVGDIYKRW IILGLNKIVR 
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00BW3842_8 AGTTSNLQEQ ITWMTS.NPP IPVGEIYKRW IVLGLNKIVR SvllSl 
O0BW3871_3 AGTTSTLQEQ IDWMTN.NPP IPVGEIYKRW IILGLNKIW mroira 
00BW3876_9 AGTTSTLQEQ IAWMTS.NPP VPVGEIYKRW I ILGLNKIVR SvllSS 
00BW3886_8 AGTTSTLQEQ IAWMTN.NPP VPVGDIYKRW I ILGLNKIVR SKvsSt 
00BW3891_6 AGTTSNLQEQ INWMTA.NPP IPVGDIYKRW IILGlSiw MYS^SILlS 
00BW3970_2 AGTTSTLQEQ IAWMTN.NPP VPVGDIYKRW IVLGLiSlVR SvllSS 
00BW5031_1 AGTTSTLQEQ IAWMTS.NPP VPVGDIYKRW I ILGLNKIVR S^IL^ 
96BW01B21 AGTTSNLQEQ IAWMTA.NPP VPVGDIYKRW IILGLNKIVR S^SILM 
96BW0407 AGTTSTLQEQ IAWMTS.NPP IPVGDIYKRW IILGLNKIvS S^SSl 
96BW0502 AGATSTLQEQ IAWMTS.NPP VPVGDIYKRW IILGLNKITO StoIL^ 
96BW06.J4 AGTTSTLQEQ IGWMTH.NPP IPVGDIYKRW I ILGLNKIVR SKvSSl 
96BW11_06 AGTTSTLQEQ IAWMTS.NPP IPVGDIYKRW I ILGLNKIVR SsP^sSS^ 
«J2?Si° AGTTSNL Q E Q INWMTS.NPP IPVGDIYKRW IILGLNKIVR~ 
96BW15B03 AGTTSTLQEQ IAWMTS.NPP IPVGDIYKRW IILGLNKIVR MYSPVSILM 
ItZ^lt ^ TTSTLQE Q IAWMTS.NPP VPVGEIYKRW III^LnSS £££££ 
96BW17A09 AGTTSTLQEQ IAWMTN.NPP IPVGDIYKRW I TMGLNKI VR MYSPVSILDI 
96BWMO!_5 AGTTSTLQEQ IAWMTS.NPP IPVGDIYKRW IILGLNKIVR SSSSS 
96BWM03.2 AGSTSTLQEQ IAWMTS.NPP VPVGDIYKRW I ILGLNKIVR SvIlSS 
98BWMC12_2 AGSTSNLQEQ IAWMTS.NPP IPVGDIYKRW I ILGLNKIVR SvlSl 
98BWMC13_4 AGTTSTLQEQ IAWMTR.NPP VPVGEIYKRW IILGLNKIVR SvsIlSi 
98BWMC14_a AGTTSTLQEQ VGWMTS.NPP IPVGDIYKRW IILGLNKIVR SvSSl 
98BWMO!4_l AGTTSTLQEQ IAWMTS.NPP VPVGEIYKRW I ILGLNKIVR M^PV^lS 
98BWMO!8_d AGTTSTLQEQ IAWMTS.NPP IPVGDIYKRW IILGLNKIVR SSwKSS 
98BWM036_a AGTTSTLQEQ IAWMTS.NPP IPVGDIYKRW IIMGLNKIVr" SvlSl 
98BWM037_d AGTNSTLQEQ IAWMTN.NPP VPVGDIYKRW I ILGLNKIVR SvllSl 
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99BW4745.8 AGTTSTLQEQ INWMTG.NPP IPVGDIYKRW IIMGLNKIvl M^SPvISd^ 
99BW4754_7 AGTTSTLQEQ ITWMTN.NPP IPVGEIYKRW I ILGLNKIVR SsPvlSl 
99BWMC16_8 AGTTSTLQEQ IAWMTS.NPP VPVGEIYKRW IILGLNKIVR JSSSSSS 
A2_CD_97CD AGATSNLQEQ IGWMTS.NPP IPVGEIYKRW I ILGLNKIVR SvsSSl 
A2_CY_94CY AGTTSTLQEQ IGWMTS.DPP IPVGEIYKRW IILGLNKIVR SsSSSS 
™jjS 9 _l™ AGTTS ^ IGWMTS.NPP VPVGEIYKRW IILgSSw 
A_BY_97BL0 AGTTSTLQEQ IGXMTs!nPP IPXGDIYKRX I ILGLNKIVR MYSPVSILDT 
AlSlSkt ^WMTG.NPP IPVGDIYKRW IILgSSS SSSSS 
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0OBW5O31_l ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQANN INIMMOR 
96BW01B21 ALGPGASIEE MMTACQGVGG PSHKARVLAE AMSQANS 
96BW0407 ALGPGASI.ee MMTACQGVGG PSHKARVLGE AMSQaVn 
96BW0502 ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQANS '. 
96BW06_J4 ALGPGATLEE MMTACQGVGG PSHKARVLAE AMSQANS 

96BW11_06 TLGPGASLEE MMTACQGVGG PSHKARILAE AMSQANN SEX 
96BW1210 ALGPGASLEE MMTACQGVGG . PSHKARVLAE AMSHAGN JSSS 
96BW15B03 ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQANS m Z 
96BW16_2 6 ASGPGASLEE MMTACQGVGG PGHKARVLAE AMSQaS: SSS 
96BW17A09 ALGPGATLEE MMTACQGVGG PSHKARVLAE AMSHANN TSIMMOK 
96BWM01_5 ALGPGASLEE MMTACQGVGG PSHKARVLAE AMSQANN S2k 
96BWM03_2 ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQaT 2K2S 
98BWMC12_2 ALGPGATLEE MMTACQGVGG PSHKARVLAE AMSHTN TIZol 
98BWMC13_4 ALGPGASLEE MMTACQGVGG PGHKARVLAE AMSQANN InSS 
98BWMC14_a SLGTGATLEE MMTACQGVGG PSHKARVLAE AMsST SSS 
98BWM014_1 ALGPGATLEE MMTACQGVGG PSHKARVLAE AMSQANS S5SX 
98BWMO!8_d ALGPGASLEE MMTACQGVGG PSHKARVLAE AMSQAGN S5££ 
98BWM036_a ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQANN mZ 
98BWM037_d ALGPGASLEE MMTACQGVGG PGHKARVLAE AMSqW ZS 
99BW3932.1 ALGPGATLEE MMTACQGVGG PSHKARVLAE AMSQANN SSSS 
99BW4642_4 ALGPGATLEE MMTACQGVGG PGHKARVLAE AMGQVNS tSX 
99BW4745_8 GLGPGATLEE MMTACQGVGG PSHKARVLAE AMSQTN 55SX 
99BW4754_7 ALGPGASLEE MMTACQGVGG PSHKARVLAE AMSQANN SSSX 
99BWMC16_8 ALGPGASIEE MMTACQGVGG PSHKARVSAE AMSQANQ 2SX 
A2_CD_97CD ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQVQNT 

A2 CY_94CY ALGPGASLEE MMTACQGVGG PSHKARVLAE AMSHVQSTn! "t' 2 

^ GPGASLBE MMTACQGVGG ^HKARVLAE AMSQVQNTN . . !snS!qk 
A_BY_97BL0 ALGSEATLEE MMTACQGVGG PGHKAXVXAE AMSQVQN ANII^OK 

tS-sE 3 6 rf ^ ACQGVGG pghkarv ^ amsqvSh: : : : : :SS5 

A_SE_SE725 . . . ... . 

AIDSES" ALGAGATLEE MMTACQGVGG ^HKARVLAE AMSQVQh! '. '. '. PNIMMQR 

A_SE_SE889 .' 

A_SE_UGSE8 GLGAGATLEE MMTACQGVGG PSHKARVLAE AMSQAQQ TNII^OR 
A_UG_92UG0 ALGAGATLEE MMTACQGVGG PGHKARVLAE AMSQVQ^ SSJX 
A_UG_U455_ ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQVQQT slZil 

AC_IN_2130 ALGPGASLEE MMTACQGVGG PSHKARVLAE AMSQTN NsSJX 

AcISElIS! ^CQGVGG PGHKARVLAE AMSQVQq! ! ! .* iSSJg 

ACD_SE_SE8 ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQAN." .* ! ' 'aNTAIMMOR 
ACG_BE_VI1 ALGPGASLEE MMTACQGVGG PSHKARVLAE AMSQANN 

A^:sfcsE7X ^ lG ^ AA f^' EE ^^T^^^yTT PGHKARVLAE AMSQATNn! ! ! INAaIS 
ADHK_NO_97 ALGPGATLEE MMTACQGVGG PGHKARVLAE AMSQVQN ! VSVMlioR 

ADK_CD_MAL ALGPGATLEE MMTACQGVGG PSHKARVLAE AMSQATN 'stUSr 

AG_BE_VI11 ALGTGATLEE MVTACQGVGG PGHKARVLAE AMSQVEI qSSS 

AG_NG_92NG ALGAGATLEE MLTACQGVGG PSHKARVLAE AMSRAT G* "tSSSSX 
AGHU_GA_VI GLGAGATLEE MMTACQGVGG PGHKARVLAE AMSQVHN 

AGU_CD_ Z 32 ALGPGATLEE MMTACQGVGG PSHKARVLAE AMSQaTs'' TjJSSS 
AJ_BW_BW21 ALGAGATLEE MMTACQGVG. PGHKAKILAE AMSQVQN 

B_AU_VH_AF ALGPAATLEE MMTACQGVGG PGHKARVLAE AMSQVT ' NsSSoR 

B_CN_RL42_ ALGPAATLEE MMTACQGVGG PSHKARILAE AMSQVTNSAI tSsaSS 

B_DE_D31_U ALGPAATLEE MMTACQGVGG PGHKARVLAE AMSQvT ^AT^QR 
B_DE_HAN_U ALGPAATLEE MMTACQGVGG PGHKARVLAE AMSQVT SISSX 
B_PR_HXB2_ ALGPAATLEE MMTACQGVGG PGHKARVLAE AMSQVT So 
B_GA_OYI_ ALGPAATLEE MMTACQGVGG PGHKARVLAE AMSQvS S^vS 
B_GB_CAM1_ ALGPAATLEE MMTACQGVGG PGHKARVLAE AMSQVT NsISoR 
B_GB_GB8_A ALGPAATLEE MMTACQGVGG PGHKARVLAE AMSQVN T£SK 

B_GB_MANC_ ALGPAATLEE MMTACQGVGG PSHKARILAE AMSQVT ! '. '. '. InSaSX 
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B_KR_WK_AF ALGPGATLEE MMTACQGVGG 

B_NL_3202A ALGPGATLEE MMTACQGVGG 

B_TW_TWCYS ALGPAATLEE MMTACQGVGG 

B_US_BC_L0 ALGPGATLEE MMTACQGVGG 

B_US_DH123 ALGPGATLEE MMTACQGVGG 

B_US_JRCSF ALGPAATLEE MMTACQGVGG 

B_US_MNCG__ ALGPAATLEE MMTACQGVGG 

B_US_P896_ ALGPGATLEE MMTACQGVGG 

B_US_RF_M1 ALGPAATLEE MMTACQGVGG 

B_US_SF2_K ALGPAATLEE MMTACQGVGG 

B_US_WEAU1 ALGPGATLED MMTACQGVGG 

B_US__WR27_ ALGPGATLEE MMTACQGVGG 

B_US_YU2_M ALGPAATLEE MMTACQGVGG 

BF1_BR_93B ALGPAATLEE MMTACQGVGG 

C_BR_92BR0 ALGPGASLEE MMTACQGVGG 

C_BW_96BW0 ALGPGASLEE MMTACQGVGG 

C_BW_96BW1 ALGPGASLEE MMTACQGVGG 

C_BW_96BW1 ALGPGASLEE MMTACQGVGG 

C_BW_96BW1 ALGPGATLEE MMTACQGVGG 

C_ET_ETH22 ALGPGASLEE MMTACQGVGG 

C_IN_93IN1 ALGPGASLEE MMTACQGVGG 

C_IN__93IN9 ALGPGASLEE MMTACQGVGG 

C_IN_93IN9 ALGPGASLEE MMTACQGVGG 

C_IN_94IN1 ALGPGASLEE MMTACQGVGG 

C_IN_95IN2 ALGPGASLEE MMTACQGVGG 

CRF01_AE_C ALGTGATLEE MMTACQGVGG 

CRF01_AE_C ALGTGATLEE MMTACQGVGG 

CRF01_AE_C ALGTGATLEE MMTACQGVGG 

CRF01_AEJT SLGTGATLEE MMTACQGVGG 

CRF01_AE_T ALGTGATLEE MMTACQGVGG 

CRFO 1_AE JT ALGTGATLEE MMTACQGVGG 

CRF01_AE_T ALGTGATLEE MMTACQGVGG 

CRF01_AEJT ALGTGATLEE MMTACQGVGG 

CRF01_AE_T ALGTGATLEE MMTACQGVGG 

CRF02_AG_F ALGPGATLEE MMTACQGVGG 

CRF02_AG_F ALGPGASLEE MMTACQGVGG 

CRF02_AG_G ALGPGATLEE MMTACQGVGG 

CRF02_AG_N ALGTGATLEE MMTACQGVGG 

CRF02_AG_S ALGPGATLEE MMTACQGVGG 

CRF02_AG_S ALGPGATLEE MMSACQGVGG 

CRF03_ABJR ALGSGATLEE MMTVCQGVGG 

CRF03_AB_R ALGSGATLEE MMTACQGVGG 

CRF04_cpx_ ALGTGATLEE MMTACQGVGG 

CRF04_cpx_ ALGTGATLEE MMTACQGVGG 

CRF04_cpx_ ALGTGATLEE MMTACQGVGG 

CRF05_DFJB ALGQQATLEE MMTACQGVGG 

CRF05_DF_B ALGPQATLEE MMTACQGVGG 

CRF06_cpx_ ALGPGATLEE MMTACQGVGG 

CRF06_cpx__ ALGPGATLEE MMTACQGVGG 

CRF06_cpx_ ALGPGATLEE MMTACQGVGG 

CRF06_cpx_ ALGPGATLEE MMTACQGVGG 

CRFll_cpx_ ALGTGATLEE MMTACQGVGG 

CRFll_cpx_ ALGVGATLEE MMTACQGVGG 

D_CD_84ZR0 ALGPGATLEE MMTACQGVGG 

D_CD_ELI_K ALGPQATLEE MMTACQGVGG 

D_CD_NDK_M ALGPQATLEE MMTACQGVGG 

D_UG_94UG1 ALGPAATLEE MMTACQGVGG 

F1_BE_VI85 ALGTGATLEE MMTACQGVGG 

F1JBR_93BR ALGPGATLEE MMTACQGVGG 

F1_FI_FIN9 GLGIGATLEE MMTACRGVGG 

F1_FR_MP41 ALGPGATLEE MMTACQGVGG 
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PSHKAKVLAE AMSQTN SAILMQR 

PSHKARVLAE AMSQTN SAILMQR 
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PGHKARVLAE AMSQVH HTNIMMQK 

PGHKARVLAE AMSQVQ HANIMMQR 

PSHKARVLAE AMSHAQ HANIMMQR 

PSHKARVLAE AMSQAQ HANIMMQR 

PSHKARVLAE AMSQAQ HANIMMQR 

PSHKARVLAE AMSHAQ QANIMMQR 

PSHKARVLAE AMSHAQ HATIMMQR 

PSHKARVLAE AMSHVQ QANIMMQR 

PGHKARVLAE AMSQVQ QSNIMMQR 

PGHKARVLAE AMSQVQ QSNIMMQR 

PGHKARVLAE AMSQVQSP NILMQR 

PGHKARVLAE AMSQVQ QANVMMQR 

PSHKARVLAE AMSQVQ QPNIMMQR 

PGHKARVLAE AMSQAQ QSNIMMQR 

PGHKARVLAE AMSQVQN ANIMMQK 

PGHKARVLAE AMSQVQN ANIMMQK 

PSHKARVLAE AMSQASN. . . . AAAAIMMQK 
PAHKARVLAE AMSQASN. . . .AAAAIMMQK 
PSHKARVLAE AMSQASS . . . .AAAAIMMQK 
PSHKARVLAE AMSQATN. . . . SAATAMMQR 
PSHKARVLAE AMSQATG. . . . SPAVMMQ.R 

PGHKARVLAE AMSQAS VGAIMMQ.K 

PGHKARVLAE AMSQASGT. . . LTAIMMQ . K 
PGHKARVLAE AMSQASGT. . . VAAAIMMQK 
PGHKARVLAE AMSQVSGA . . . TAAIMMQ . K 

PSHKARVLAE AMSQVQQ TNVMMQR 

PSHKARVLAE AMSQAQH. . / ...TNIMMQR 
PSHKARVLAE AMSQATS. . . . ASAAIMMQK 
PSHKARVLAE AMSQATN... . SVTTAMMQR 
PGHKARVLAE AMSQVTG. . . . SATAVMMQR 
PSHKARVLAE AMSQATN. . . . ANTAIMMQR 

PSHKARVLAE AMSQAN SAIMMQK 

PSHKARVLAE AMSQATN TAIMMQK 

PGHKARILAE AMSQAN TTIMMQK 

PGHKARVLAE AMSQATN AAIMMQK 
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B_GAJDYI NDPSSQ 

B__GB_CAM1_ NDPSSQ 
B__GB_GB8_A NDPSSQ 
B_GB_MANC__ NDPSSQ 
B__KR_WK_AF NDPSSQ 
B_NL_32 02A NDPSSQ 
B_TW_TWCYS NDPSSQ 
B_US_BC_L 0 NDPS SQ 
B_US_DH123 NDP... 
B__US_JRCSF NDPSSQ 
B__US_MNCG_ NDPLSQ 
B__US_P896_ NDPSSQ 
B_US_RF_M1 NDPSSQ 
B_US_SF2_K NDPSSQ 
B__US_WEAU1 NDPSSQ 
B_US_WR2 7_ NDPSSQ 
B_US_YU2_M SDPSSQ 
BF1_BR_93B NDPSSQ 
C_BR_92BR0 SDPLST 
C_BW_96BW0 SDPLSQ 
C_BW_96BW1 SDPLSQ 
C__BW__96BW1 NDPLSQ 
C_BW_96BW1 SDPLSQ 
C_ET_ETH2 2 NDHLLQ 
C_IN_93IN1 SDLLSQ 
C_IN_9 3 IN9 SDPLSQ 
C_IN_93IN9 SDPLSQ 
C_IN_94IN1 SDPLSQ 
CJEN_95IN2 SDPLSQ 
CRFO 1_AE_C NDPLSQ 
CRFO 1_AE_C NDPLSQ 
CRF01_AE_C NDPLSQ 
CRF 01__AE JT NDPSSQ 
CRF01_AE_T NDPLSQ 
CRF01_AE_T NDPLSQ 
CRF01_AE_T NDPLSQ 
CRF 0 1_AE_T NDPLSQ 
CRF01_AEJT NDPLSQ 
CRF02_AG_F NDP... 
CRF02_AG_F NDP. . . 
CRF02_AG_G NDP. . . 
CRF02_AG_N NDP. . . 
CRF02_AG_S NDP. . . 
CRF02_AG_S NDPYSQ 
CRF03_AB_R DDPLSQ 
CRF03_AB_R NDPLSQ 
CRF04_cpx_ SDPLSQ 
CRF04_cpx_ NHPLSQ 
CRF04_cpx_ SDPLSR 
CRF05JDFJB NDPLSQ 
CRF 0 5_DF__B NDPLSQ 
CRF06_cpx_ SDP... 
CRF06_cpx_ NDP. . . 
CRF06_cpx_ NDP. . . 
CRF06_cpx_ SDP... 
CRFll_cpx_ SDP. . . 
CRFll^cpx^ SDPLSQ 
D_CD_84ZR0 NDPLSQ 
D_CD_ELI_K NDPLSQ 
D _CD_NDK_M NDPSSQ 
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D_UG_94UG1 

F1_BE_VI85 

F1_BR_93BR 

F1_FI_FIN9 

F1_FR_MP41 

F2_CM__MP25 

F2KUJBE_VI 

G_BE_DRCBL 

G_NG_92NG0 

G_jSE_SE616 

H_BE_VI991 

H_BE_VI997 

H_CF_90CF0 

J__SE_SE702 

J_SE_JSE7 88 

K__CD_EQTB1 

K_CM_MP535 

N_CM__YBF3 0 

O__CM_ANT70 

0_CM_MVP51 

0__SN__99SE_ 

0_SN_99SE_ 

U CD 83C 



NDPLSQ 
NDP. . . 
NDP. . . 
NDP. . . 
SDP . . . 
SDQ . . . 
SDPLLQ 
NDQ . . . 
SDP . . . 
SDP . . . 
NDQ. . . 
NDPLSQ 
SDPLLQ 
SDPLSQ 
SDPLSQ 
SDPLSQ 
NDPLSQ 
NDPSSQ 
. TDQ . . . 
TDQ. . . 
TDQ. . . 
TDQ . . . 
SDPSLQ 
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Table 12. HIV Env Sequence Alignment 
GCG Multiple Sequence File. 
Written by Omiga 1.1 



Name : 


00BW0762_1 


Len: 


962 


Check : 


4645 


Weight 


: 


1.00 


Name: 


00BW0768_2 


Len: 


962 


Check : 


9565 


Weight 


: 


1.00 


Name : 


00BW0874_2 


Len: 


962 


Check : 


7745 


Weight 


: 


1.00 


Name : 


00BW1471_2 


Len: 


962 


Check : 


9593 


Weight 


: 


1.00 


Name: 


00BW1616_2 


Len: 


962 


Check: 


792* 


Weight: 




1.00 


Name: 


00BW1686_8 


Len: 


962 


Check : 


3744 


Weight 


: 


1.00 


Name : 


00BW1759_3 


Len: 


962 


Check: 


9808 


Weight 


: 


1.00 


Name : 


00BW1773_2 


Len: 


962 


Check : 


3500 


Weight 


: 


1.00 


Name : 


00BW1783_5 


Len: 


962 


Check: 


9684 


Weight 


: 


1.00 


Name : 


00BW17 95_6 


Len: 


962 


Check: 


8410 


Weight 


• 


1.00 


Name : 


00BW1811 3 


Len: 


962 


Check : 


2068 


Weight 




1.00 


Name : 


00BW1859_5 


Len: 


962 


Check: 


5692 


Weight 


• 


1.00 


Name : 


00BW1880_2 


Len: 


962 


Check: 


1901 


Weight 


; 


1.00 


Name : 


00BW1921JL 


Len: 


962 


Check : 


5923 


Weight 


; 


1.00 


Name .- 


00BW2 036_1 


Len: 


962 


Check : 


7035 


Weight 


; 


1.00 


Name: 


00BW2O63_6 


Len: 


962 


Check: 


4853 


Weight 


: 


1.00 


Name : 


00BW2087_2 


Len: 


962 


Check : 


2085 


Weight 


; 


1.00 


Name: 


00BW2127__2 


Len: 


962 


Check: 


4015 


Weight 


: 


1.00 


Name: 


00BW2128_3 


Len: 


962 


Check : 


5884 


Weight 


: 


1.00 


Name: 


00BW2276_7 


Len : 


962 


Check: 


8913 


Weight 


: 


1.00 


Name: 


00BW3 819_3 


Len: 


962 


Check : 


9390 


Weight 


: 


1.00 


Name : 


00BW3 842_8 


Len: 


962 


Check: 


8867 


Weight 


: 


1.00 


. Name : 


00BW3 871_3 


Len: 


962 


Check: 


7069 


Weight 


: 


1.00 


Name : 


O0BW3 876_9 


Len: 


962 


Check: 


4761 


Weight 


: 


1.00 


Name : 


O0BW3 886_8 


Len: 


962 


Check: 


7681 


Weight 


: 


1.00 


Name: 


O0BW3 891_6 


Len: 


962 


Check: 


379 


Weight : 




1.00 


Name : 


00BW3970_2 


Len: 


962 


Check: 


8001 


Weight 


: 


1.00 


Name: 


OOBW5031_1 


Len: 


962 


Check: 


4902 


Weight 


: 


1.00 


Name : 


96BW01B21 


Len: 


962 


Check : 


5774 


Weight 


: 


1.00 


Name : 


96BW0407 


Len : 


962 


Check: 


4260 


Weight 


: 


1.00 


Name ; 


96BW0502 


Len: 


962 


Check: 


4658 


Weight 


: 


1.00 


Name: 


96BW06_J4 


Len: 


962 


Check : 


9749 


Weight 




1.00 


Name: 


96BW11_06 


Len: 


962 


Check: 


4328 


Weight 




1.00 


Name: 


96BW1210 


Len : 


962 


Check: 


3855 


Weight 




1.00 


Name: 


96BW15B03 


Len: 


962 


Check: 


9133 


Weight 




1.00 


Name : 


96BW16_26 


Len: 


962 


Check: 


5 Weight : 


1 


.00 


Name: 


96BW17A09 


Len: 


962 


Check : 


6458 


Weight 




1.00 


Name : 


96BWM01_5 


Len: 


962 


Check: 


9487 


Weight 




1.00 


Name: 


96BWM03_2 


Len: 


962 


Check : 


8766 


Weight 




1.00 


Name : 


98BWMC12_2 


Len: 


962 


Check : 


2722 


Weight 




1.00 


Name: 


98BWMC13_4 


Len: 


962 


Check : 


2526 


Weight 




1.00 


Name : 


98BWMC14_a 


Len: 


962 


Check : 


7761 


Weight 




1.00 


Name : 


98BWM014_1 


Len: 


962 


Check: 


93 Weight: 




1.00 


Name : 


98BWM018_d 


Len: 


962 


Check: 


279 


Weight : 




1.00 


Name : 


98BWM036_a 


Len: 


962 


Check : 


134 


Weight : 




1.00 


Name : 


98BWM037_d 


Len: 


962 


Check : 


9669 


Weight . 




1.00 


Name : 




Len : 


962 


Check: 


3527 


Weight : 




1.00 


Name : 


99BW4642_4 


Len: 


962 


Check: 


1175 


Weight : 




1.00 


Name : 


99BW4745_8 


Len: 


962 


Check :- 


8117 


Weight : 




1.00 


Name : 


99BW4754_7 


Len: 


962 


Check : 


5709 


Weight : 




1.00 


Name : 


99BWMC16_8 


Len: 


962 


Check : 


285 


Weight : 




1.00 


Name : 


A2_CD_97CD 


Len: 


962 


Check : 


2892 


Weight : 




1.00 


Name: 


A2_CY_94CY 


Len: 


962 


Check : 


8628 


Weight : 




1.00 


Name : 


A2D 97KR 


Len: 


962 


Check : 


471 


Weight : 




1.00 


Name : 


A2G__CD_97C 


Len: 


962 


Check : 


939 


Weight : 




1.00 


Name : 


AJBY_97BL0 


Len: 


962 


Check: 


4291 


Weight : 




1.00 


Name : 


A_KE_Q23_A 


Len: 


962 


Check : 


1190 


Weight : 




1.00 
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Name 


: A_SE_SE659 


Len 


Name 


: A_SE_SE725 


Len 


Name 


: A__SE_SE753 


Len 


Name 


: A_SE_SE853 


Len 


Name 


: A_SE_SE889 


Len 


Name 


: AJ3E_UGSE8 


Len 


Name 


: A__UG_92UG0 


Len 


Name 


: A_UG_U455 


Len 


Name 


: AC_INJ2130 


Len 


Name 


: AC_RW_92RW 


Len 


Name 


: AC_SE_SE94 


Len 


Name 


: ACD_SE_SE8 


Len 


Name 


: ACG_BE_VI1 


Len : 


Name 


: AD_SE_SE69 


Len : 


Name 


: AD_SE_SE71 


Len : 


Name 


: ADHK_NO_97 


Len : 


Name 


: ADK_CD_MAL 


Len : 


Name 


: AG_BE_VI11 


Len : 


Name 


: AGJKTG 92NG 


Len : 


Name 


: AGHU_GA__VI 


Len : 


Name 


: AGU_CD_Z32 


Len : 


Name 


: AJ_BW__BW2 1 


Len : 


Name 


: B_AU_VH_AF 


Len : 


Name 


: B_CN_RL42_ 


Len : 


Name 


: B_DE_D31_U 


Len : 


Name 


: B_DE_HAN_U 


Len : 


Name 


: B_FR_HXB2_ 


Len: 


Name 


: B_GA_OYI__ 


Len : 


Name 


: B_GB_CAM1_ 


Len: 


Name 


: B_GB_GB8_C 


Len: 


Name 


: B_GB_MANC__ 


Len : 


Name 


: B_KR_WK_AF 


Len : 


Name 


s B_NL_3202A 


Len : 


Name 


: B_TW_TWCYS 


Len : 


Name 


BJQS_BC_L0 


Len : 


Name : 


BJCFS_DH123 


Len : 


Name : 


B_US_JRCSF 


Len : 


Name : 


B_US__MNCG_ 


Len : 


Name : 


B_US_P896_ 


Len: 


Name : 


B_US_RF_M1 


Len : 


Name : 


B_US_SF2_K 


Len : 


Name : 


B_US_WEAU1 


Len: 


Name : 


B__US__WR27__ 


Len : 


Name : 


B_US_YU2_M 


Len: 


Name : 


BF1_BR_93B 


Len : 


Name : 


C_BR_92BR0 


Len : 


Name : 


C_BW_96BW0 


Len : 


Name : 


C_BW_96BW1 


Len: 


Name : 


C_BW_96BW1 


Len : 


Name : 


C_BW_96BW1 


Len: 


Name : 


C_ET ETH22 


Len : 


Name : 


C_IN_93IN1 


Len : 


Name : 


C_IN_93IN9 


Len: 


Name : 


CJCN_93IN9 


Len: 


Name : 


C_IN_94IN1 


Len: 


Name : 


C_IN_95IN2 


Len: 


Name: 


CRF01_AE_C 


Len: 


Name: 


CRF01_AE_C 


Len: 


Name : 


CRF01_AE_C 


Len: 


Name : 


CRF01_AE__T 


Len: 


Name : 


CRF01_AE_T 


Len: 



962 


Check: 


6674 


Weight : 


i nn 
X ■ uu 


962 


Check : 


4925 


Weight : 


J.. Uu 


962 


Check: 


2482 


Weight : 


t nn 

-L • UU 


962 


Check: 


1860 


Weight : 


JL . U U 


962 


Check: 


2102 


Weight : 


t nn 


962 


Check : 


5063 


Weight : 


i nn 

X • UU 


962 


Check: 


6685 


Weiaht - 


x ■ u u 


962 


Check : 


8657 


Weight : 


1 on 

X • u u 


962 


Check : 


7784 


Weight : 


x ■ uu 


962 


Check : 


4676 


Weight : 


T nn 
x . u u 


962 


Check : 


2949 


Weight : 


i fin 
x . u u 


962 


Check : 


1464 


Weight : 


i nn 
x . u u 


962 


Check : 


2980 


Weight : 


i fin 
x . u u 


962 


Check: 


8959 


Weight : 


t n n 
x • u u 


962 


Check : 


7056 


Weight : 


t n a 

X . UU 


962 


Check: 


487 


We "i crh t- • 


t n a 
x . u u 


962 


Check : 


2555 


Weight : 


t n n 
x . u u 


962 


Check : 


6342 


Weight : 


1 A A 

x . uu 


962 


Check : 


1272 


Weight : 


T A A 
X . UU 


962 


Check : 


1974 


Weiaht - 


T A A 
X . UU 


962 


Check : 


4356 


Weiaht • 


T A A 
X . UU 


962 


Check: 


9995 


Weiaht • 


1 A A 
X . U U 


962 


Check : 


5833 


Weight : 


1 fi A 
X » U U 


962 


Check: 


4092 


Weight : 


t nn 

X • UU 


962 


Check : 


5486 


Weight : 


1 n n 
x . u u 


962 


Check: 


3480 


Weight : 


1 nn 

X . u u 


962 


Check: 


6939 


Weight : 


1 on 

X . u u 


962 


Check: 


9780 


Weight : 


T n n 
x • u u 


962 


Check: 


9716 


Weight : 


i n n 
x . u u 


962 


Check : 


4180 


Weight : 


i n n 
x • u u 


962 


Check: 


9762 


Weight : 


T nn 

X • U U 


962 


Check: 


6641 


Weiaht • 


i n n 
x . u u 


962 


Check : 


7168 


Weight : 


i n n 
x . u u 


962 


Check: 


3591 


Weight : 


i n n 
X • u u 


962 


Check: 


7266 


Weight : 


t n n 
x . u u 


962 


Check: 


6905 


Weight : 


i n n 
X . u u 


962 


Check: 


9381 


Weiaht- • 


t n n 
x . u u 


962 


Check : 


9951 


Weight : 


t n n 
x . u u 


962 


Check : 


5855 


Weiaht - 


t n n 
x . u u 


962 


Check : 


6075 


Weiaht- • 


T A A 

1. uu 


962 


Check : 


1434 


Weiaht • 


t n n 
x • u u 


962 


Check: 


5451 


Weiaht • 


i n n 
x . u u 


962 


Check: 


4262 


Weight : 


i n n 
x • u u 


962 


Check : 


5841 


Weight : 


T no 

X • u u 


962 


Check : 


5506 


Weiaht * 


1 nn 
x . u u 


962 


Check : 


8769 


Weight : 


1 nn 

X . u u 


962 


Check : 


6197 


Weight : 


i nn 


962 


Check : 


8144 


Weight : 


i no 


962 


Check: 


1160 


Weight : 


i no 

X . u u 


962 


Check : 


2736 


Weight : 


1.00 


962 


Check : 


8219 


Weight : 


1.00 


962 


Check : 


4068 


Weight : 


1.00 


962 


Check : 


3674 


Weight : 


1 00 


962 


Check: 


1581 


Weight : 


1.00 


962 


Check: 


9352 


Weight : 


1.00 


962 


Check : 


6988 


Weight: 


1.00 


962 


Check : 


8684 


Weight: 


1.00 


962 


Check: 


3342 


Weight : 


1.00 


962 


Check: 


5017 


Weight : 


1.00 


962 


Check: 


9124 


Weight : 


1.00 


962 


Check : 


2718 


Weight : 


1. 00 
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CRF01_AE__T 

CRF01_AEJT 

CRF01_AE__T 

CRF01_AE__T 

CRF02_AG_F 

CRF02_AG_F 

CRF02_AG_G 

CRF02_AG_N 

CRF02_AG_S 

CRF02__AG_S 

CRF03_AB_R 

CRF03_AB_R 

CRF04_cpx_ 

CRF04_cpx_ 

CRF04_cpx__ 

CRF05JDF_B 

CRF05__DF_B 

CRF06_cpx_ 

CRF06_cpx_ 

CRF06_cpx_ 

CRF06_cpx_ 

CRFll_cpx_ 

CRFll_cpx_ 

D_CD_84ZR0 

D_CD_ELI_K 

D_CD_NDKJM 

D_UG_94UG1 

F1__BE_VI85 

F1_BR_93BR 

F1_FI_FIN9 

F1__FR_MP41 

F2_CM_MP25 

F2KUJBE_VI 

G_BE_DRCBL 

G_NG_92NG0 

G_SE_SE616 

H_BE_VI991 

H_BE__VI997 

H_CF_90CF0 

JJSE_SE702 

J_SE_SE788 

K_CD_EQTB1 

K_CM__MP535 

N_CM__YBF3 0 

Q_CM_ANT7 0 

0_CM_MVP5 1 

Q_SN_99SE_ 

0__SN_99SE_ 

U CD 83C 



Len: 

Len: 

Len : 

Len: 

Len: 

Len : 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len : 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len : 

Len: 

Len: 

Len : 

Len: 

Len: 

Len : 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 



962 


Check : 


2104 


Weight : 


1 . 00 


962 


Check : 


8495 


Weight : 


1 . 00 


962 


Check : 


4076 


Weight : 


1 . 00 


962 


Check : 


94 8 


Weight : 


1 - 00 


962 


Check: 


9298 


Weight : 


1 . 00 


962 


Check : 


9278 


Weight : 


1 . 00 


962 


Check: 


4373 


Weight : 


1 . 00 


962 


Check : 


8955 


Weight : 


1 . 00 


962 


Check: 


252 


Weight : 


1 . 00 


962 


Check : 


5147 


Weight : 


1 . 00 


962 


Check: 


2239 


Weight : 


1 . 00 


962 


Check : 


2671 


Weight : 


1 . 00 


962 


Check : 


4892 


Weight : 


1 . 00 


962 


Check: 


8070 


Weight : 


1 . 00 


962 


Check: 


5453 


Weight : 


1 . 00 


962 


Check : 


174 


Weight : 


1 . 00 


962 


Check : 


2694 


Weight : 


1.00 


962 


Check: 


7351 


Weight : 


1 . 00 


962 


Check: 


5073 


Weight : 


1 . 00 


962 


Check : 


661 


Weight : 


1 . 00 


962 


Check : 


8440 


Weight : 


1 . 00 


962 


Check: 


2217 


Weight : 


1 . 00 


962 


Check : 


8216 


Weight : 


1 . 00 


962 


Check : 


4843 


Weight : 


1 . 00 


962 


Check: 


8403 


Weight : 


1.00 


962 


Check : 


5813 


Weight : 


1 . 00 


962 


Check : 


94 07 


Weight : 


1 . 00 


962 


Check: 


2982 


Weight : 


1 . 00 


962 


Check: 


8919 


Weight : 


1 . 00 


962 


Check : 


6761 


Weight : 


1 . 00 


962 


Check: 


478 


Weight : 


1 . 00 


962 


Check: 


9292 


Weight : 


1 . 00 


962 


Check: 


567 


Weight : 


1 . 00 


962 


Check : 


6261 


Weight : 


1 . 00 


962 


Check : 


4508 


Weight : 


1 . 00 


962 


Check : 


6733 


Weight: 


1 . 00 


962 


Check : 


7498 


Weight : 


1 . 00 


962 


Check : 


8345 


Weight : 


1 . 00 


962 


Check : 


2490 


Weight : 


1 . 00 


962 


Check: 


4446 


Weight : 


1 . 00 


962 


Check: 


1662 


Weight : 


1 . 00 


962 


Check : 


7406 


Weight : 


1.00 


962 


Check: 


512 


Weight : 


1.00 


962 


Check : 


1733 


Weight: 


1.00 


962 


Check: 


75 Weight : 


1.00 


962 


Check : 


3290 


Weight : 


1.00 


962 


Check : 


6963 


Weight : 


1.00 


962 


Check: 


6278 


Weight : 


1.00 


962 


Check: 


9044 


Weight : 


1.00 



00BW0762, 


_1 


.... MRVMGI 


MRNC . 


QQWWI 


WV.ILGFWML 


00BW0768] 


J2 


. . . .MRVREI 


LRNC. 


QQWWT 


WG. 


SLGFWMV 


00BW0874] 


~2 


. . . .MRAMGT 


QRNC . 


RQWWI 


WG. 


ILGFWML 


00BW14 71] 


~2 


. . . . MRVMGI 


LRSC. 


QQWWI 


WG. 


ILGFWML 


00BW1616] 


~2 


. . . .MRVMGI 


QRNC. 


QRWWI 


WG. 


ILGFWMI 


O0BW1686~ 


~8 


. . . . MRVKGI 


QRNW. 


PQWWI 


WG. 


SLGFWML 


00BW1759~ 


~3 


. . . . MRVRGI 


PRNW. 


QQWWI 


WG. 


ILGFCMI 


00BW1773^ 


J2 


. . . . MRVREI 


LRSY. 


QHWWM 


WS. 
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VQCTHGIKPV VSTQLLLNGS LSEE.GIVIR SENITNNAKT IIVHLNESVE* 
VQCTHGIKPV VPTQLLLNGS LAEE.EIIIR SEALTNNAKT IIARLNKSVE 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SENLAKNDKT IIVHLNESVE 
VLCTHGIKPV VSTQLLLNGS LSEG.EIMIR SENLTDNAKT IIVQLNKPVE 
VQCTHGIKPV VSTQLLLNGS LAEE.EIVIR SENITNNAKI IIVHLNETVD 
VQCTHGIKPV VSTQLLLNGS LAEG.EIIIR SENMTDNVKP IIVHLNESVE 
VQCTHEIKPV VSTQLLLNGS LAEE.QIIIR SENMTNNAKT IIVHLKKPVQ 
VQCTHGIKPV VSTQLLLNGS LAEK.EIIIR SENLTNNAKI IIVHLKDPVN 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SENLTNNVKT IWHLNESIQ 
VQCTHGIKPV VSTQLLLNGS LAEG.EIIIR SKNLTDNART IIVHLNESVQ 
VQCTHGIKPV VSTQLLLNGS LAEE.DIIIR SENLTDNVKT IIVHLNESVE 
VQCPHGIKPV VSTQLLLNGS LAEE.EIIIR SENIEDNVKT IIVHLNESIE 
VQCTHGIKPV VATQLLLNGS LAEE.EIVIR FKNITNNAKI IIVQLNTSVG 
VQCTHGIKPV VSTQLLLNGS LAEE.GIIIR SENLTDNAKT ITVQLDQAVE 
VQCTHGIRPV VSTQLLLNGS LAEG.EIIIR S ENLADNAKT IIVHFNESVE 
VQCTHGIKPV VSTQLLLNGS LAEK.EIIIR SKNITDNVKT IIVHLNEAVE 
™™ V VSTQLLLNGS LAEE.EIIIR SENLTNNAKT IIVHLNDSVE 
VQCTHGIKPL VSTQLLLNGS LAQQ . QIMIT SENLTNNAKI I I VHLKEAIN 
VQCTHGIKPV VSTQLLLNGS LAEE.IIIR. SENLTNNAKT IIVQLKEPVK 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SENLTNNVKT IIVHLNKSVE 
VQCTHGIKPV VSTQLLLNGG LAEK.EIIIR SENLTNNVKT IIVHLNESVE 
^THGIKPV VSTQLLLNGS LAEG.EIIIR SKNLSDNAKI IIVHLNESVG 
VQCTHGIKPV VSTQLLLNGS IAEG.EIIIR SENLTNNAKT IIVHLNESVK 
IQCTHGIKPV VSTQLLLNGS LAEK.DIIIR SENLTNNIKT IIVHLNDSVQ 
VQCTHGIKPV VSTQLLLNGS LAEG.EIIIR SENLTNNAKT IIVHLNESIE 
VQCTHGIKPV VS TQLLSNGS LSEKlEIIIR SENLTNNVKT TIVHLNESVE 
IQCTHGIKPV VSTQLLLNGS RAEK.EIIIR SENMTNNAKT IIVHLNESIE 
VQCTHGIKPV VSTQLLLNGS LA.E.EIVIR SENI SDNAKT IIVHLNESVE 
VQCTHGIKPV VSTQLLLNGS LAEE.DIIIR SENLTDNVKT IIVHLNESIE 
^™ GIKPV VSTQLLLNGS LAEE.EIIIR SENLTDNAKI IIVHLNESVR 
VQCAHGIKPV VSTQLLLNGS VAKG.EIIIR SENLTNNAKI IIVQLNKPVK 
^™™ V VSTQLLLNGS LAEE.KIIIR SGNVTNNAKT IIVHLNESVE 
VQCTHGIKPV VSTQLLLNGS LAEK.EIIIR FKNITNNAKT IIVHLNESVE 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SENITDNVKT IIVHLNKSVE 
^THGIKPV VSTQLLLNGS LAEG.EIIIR SKNLSNNAYT IIVHLNDSVE 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SENLTNNAKT LIVHLNESVE 
VQCTHGIKPV VSTRLLLNGS LAEE.EIIIR SENLTNNAKI IMVQLNES IE 
VQCTHGIKPV VSTQLLLNGS LAEK.EIIIR SKNITDNVKT IIVHLNESVE 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SENLTDNAKT IIVHFNESVQ 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SENLTNNVKT IIVHLNESVE 
VQCTHGIKPV VSTQLLLNGS LAEK.EIVIR SENLTDNVKT IIVHLNQSVQ 
VQCTHGIKPV VSTQLLLNGS LAEE.EWIR SENLTNNAKT IIVQLKEPVK 
VQCTHGIKPV VSTQLLLNGS LAEK.EIIIR SENITNNIKT IIVHLNEPVE 
VQCTHGIKPV VSTQLLLNGS LAEE.EIIIR SKNITNNANT IIVQLKDPVD 
IQCTHGIKPV VSTQLLLNGS IAEE.EIIIR SENLTDNVKT IIVHLKDPID 
VQCTHGIKPV VSTQLLLNGS IAEE.EIIIR SENLTNHPRT IIVQLNQSIE 
VQCTHGIKPV VSTQLLLNGS LAEE.EIVIR SENLTNNVKT IIVHLNESIE 
VQCTHGIKPV VSTQLLLNGS LAEG.EIIVR SENLTDNVKT IIVHLDKPVG 
VQCTHGIKPV VSTQLLLNGS LAEK.EIIVR SENLTNNAKI IIVQLKQSVG 
^CTHGIKPV VSTQLLLNGS RAEK.EWIR AESLTDNAKV IIVHLKDSVQ 
VQCTHGIKPV VSTQLLLNGS LAEE.DIIIR SKNLTDNAKT IIVHLNESVN 
VQCAHGIRPV ASTQLLLNGS LAEG.KVMIR SENITDNAKN IIVQFNKPVP 
VQCTHGIKPV ASTQLLLNGS LAEGGKIMIR SENITNNAKN IIVQPTKPVL 
VQCTHGIRPV ASTQLLLNGS LAEGNKTIIR SANITDNTKN IIVQFTKPVO 
^ C ^ IRPV VSTQLLLNGS LAEE.EVMIR SENITDNAKN IIVQFDKPVE 
VQCTHXIKPV VSTQLLLNGS LAEKX.VMIR SENITDNVKI IIVQLTEPVN 
^™~ IKPV VSTQLLLNGS LAEKN.ITIR SENITNNAKI IIVQLVQPVT 
VQCTHGIKPV VSTQLLLNGS LAKGG.IRIR SENITNNVKT ILVQLDKPVN 
VQCTHGIKPV VSTQLLLNGS LAEEK.IMIR SENISDNAKT IIVQLTEPVT 
VQCTHGIKPV VSTQLLLNGS LATK. . IMIR SENITNNAKT IIVQLVEPVE 
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550 

QIINMWQGVG KAMYAPPIAG NIICKSNITG LLLTRDGGEE N. TTE 
QIINMWQEVG. RAMYAPP I EG NITCKSNITG LLLVRDGGKT ED NKSE 
QIINLWQEVG RAIYAPPIAG NITCKSNITG LLLTRD . GG . NNS TTE 
QIINMWQGVG QAMYAPPIAG NITCRSNITG LLLTRDGGIN . . .EDDNNTE 
QIINLWQGVG RAMYAPP IAG NITCKSNITG LLLTRD GGGE N NSTE 
QIINMWQEVG RAIYAPPIAG KITCISNITG TLLTRDGGVS NTTe!.'.GNE 
QIINMWQEVG RAMYAPPIEG NITCNSSITG LLLTRDGGKN S. TNNGTE 
QIINMWQKVG RAMYAPPIAG NITCKSNITG LLLTRDGGNT S. STEE 
QIINMWQGVG QAIYAPPIAG NITCKSNITG LLLTRDGG. . NN TENTE 
QIINMWQKVG RAMYAPPIEG NITCISNITG LLLTRDGG.. YE ANHTE 
QIINLWQEVG RAMYAPPIAG NITCKSNITG LLLTRD GGGS NTTN ATE 
QIINMWQEVG RAMYAPPIAG NITCKSKITG LLLTRDGGKQ . . ' *NESK 
QIINMWQGVG RAMYAPPIEG NITCNSNITG LLLTRNRGRE NGD NTTE 
QIINMWQGVG RAIYAPPIEG NITCKSNITG LLLTRDGGKG NDT AE 
QIINMWQKVG RGIYAPPIEG SITCNSNITG LLLVRDGG. . IN TSTVE 
QIINMWQGVG RAMYAPPIAG NITCTSNITG LILTRDGGG. NE "tNETE 
QIINMWQEVG RAMYAPPIAG NITCKSNITG ILLTRDGGED TKN KTE 
QIVNMWQGVG RAIYAPPIAG NITCNSSITG LLLLRDGGTE TENN *RTE 
QIINLWQEVG RAMYAPPIEG NITCKSNITG LLLTRDGGTN N NNTE 
QIINMWQGVG RAIYASPIEG SITCKSNITG LLLVHDGG. . NSNT*' STE 
QIINMWQEVG RAIYAPPIAG NITCTSNITG LLLTRDGEPS TE 
QVINMWQRVG QAIYAPPIEG IITCNSSITG LLLVRDGD . . NQ TSDTE 
QIINMWQEVG RAIYAPPIRG IITCTSNITG LLLTRDGGNT GGN TTE 
QIINMWQEVG RAMYAPPIAG NITCTSNITG LLLTRDGG. N GG. . NNTE 
QFIRMWQRVG QAMYAPPIAG NITCRSNITG LLLTRDG. . . KNDTE 
QIINMWQGVG RAMYAPPIAG RIICKSNITG LELVRDGGQD N. . . VMNATE 
QIINMWQKVG RAIYAPPIAG KITCKSNITG LLLVRDGGGG NN TATE 
QIINMWQGVG RAMYAPPIAG NIICKSNITG VLLTYDGGEE N E 
QIINMWQGVG RAMYAPP I KG SITCRSNITG LLLTRD GGLN RS * TEEPE 
QIINMWQGVG RAIYAPPIAG NITCVSNITG LLLTWDGGHQ SN E 
QIINMWQKVG RAMYAPPIAG NLTCESD ITG LLLTRDGGKT G PNDTE 
QIINMWQEVG RAIYAPPIAG NITCKSNITG LLLTRDGGLN NDS E 
QPINLWQEVG RAMYAPPIAG NIICKSNITG LLLTRDG. . . .D. . . KNDSE 
QIINRWQEVG RAMFAPP IAG NITCKSNITG ILLVRDGGNT SEN IE 
QIINMWQKVG RAIYAPPIEG NITCSSSITG LLLARDGG. . LD. 'nVTTE 
QIINMWQGVG RAMYAPPIEG NITCKSNITG LLLVRDGGTE ENN TGTE 
QIINMWQGXG QAMYVPPIAG NITCRSNITG LLLTRDGGK . . . VTGNTTE 
QIINMWQGVG RAM YAS PI AG NITCKSNITG LLLTRDGG.. NE TSGIE 
QIINTWQEVG RAIYAPPIAG NIICISNITG LLLTRDGGKT ND TNDTE 
QIINMWQEVG RAMYAPPIAG NITCRSNITG LLLTRD. GGN TTE TKE 
QIINMWQGVG RAMYAPPIAG NITCISNITG LILTRDGG. . VN *RSDTE 
QIINMWQEVG RAIYAPPIKG NITCESNITG LLLTRDGGSN DTT E 
QIINMWQGVG QAMYAPPIAG NITCKSNITG ILLTRDGGIN NTN "<3TE 
QIINMWQKVG RAIYAPPIAG NITCSSRITG LLLTRDGGKN . DTHE 
QIINMWQEVG RAMYAPPIAG NITCKSNITG LLLVRDGGNN NTT E 
QIINMWQKVG RAMYANPIEG NITCRSNITG LLLENDG. . . N M 
QIINMWQKVG RAMYAPPIAG NITCKSNITG LLLVRDGGTA TD E 
QIINMWQEVG RAMYAPPIAG NITCQSNITG LLLTRDGGTE TD. ! NKTE 
QIINMWQEVG RAMYAPPIEG NITCKSNITG LLLVRDGGGK N ATNDTE 
QIINMWQEVG RAMYAPPIAG RIICNSTITG LILTRDGGNT N NTE 
QIINRWQEVG RAMYAPPIAG NITCTSNITG LLLVRDGGRT SD. "sTKE 
QIINMWQRVG RAMYAPPIAG VIKCTSNITG MILTRDG. .G KNS INE 
QIINMWQRVG RAMYAPPIAG IIKCTSNITG IILTRDG. .G NNG TNE 
QIVNMWQRVG RAMYAPPING TIKCTSNITG MILTRDGNSG GNA TNE 
QIINMWQRVG RAMYAPPIAG IINCTSNITG IILTRDGEKG GDN TIE 
QIINMWQRVG QAMYAXPIKX SIRCESNITG LLLTRDGXGX TNX SNE 
QIINMWQRAG QAMYAPPIPG VIKCESNITG LLLTRDGGKD NN. . VNE 
QIINMWQRAG KAMYAPPIQG VIRCESNITG LILTRDG.GD AG. . ENE 
QIINMWQRAG QAIYAPPIPG IIRCESNITG LLLTRDG. GV VNS TNE 
QIINMWQRVG QAMYAPPIRG AIRCKSNITG LLLTRDGGNS NSS. TNE 
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RAWRAVLHIP 
RAWRAILHIP 
RAFRAI LHI P 
RAFRAILHIP 
RAFRALLHIP 
RAFRALLHIP 
RIGRGILHIP 
RIGTGIRNIP 
RIGQGFLHIP 
RIGRGILNIP 
RIGRGILNIP 
RICRAILNIP 
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00BW0762_1 
00BW0768_2 
00BW0874_2 
00BW1471_2 
00BW1616_2 
00BW1686_8 
00BW1759_3 
00BW1773_2 
00BW1783_5 
00BW1795_6 
00BW1811_3 
00BW1859_5 
OOBW18 80_2 
O0BW1921_l 
0 0BW2036_1 
00BW2063_6 
0 0BW2 087_2 
00BW2127__2 
00BW2128_3 
00BW2276_7 
00BW3 819__3 
00BW3 842_8 
00BW3871_3 
00BW3 876_9 
00BW3 886__8 
00BW3 891_6 
0OBW397O_2 
00BW5031_1 
96BW01B21 
96BW0407 
96BW0502 
96BW06_J4 
96BW11_06 
96BW1210 
96BW15B03 
96BW16_26 
96BW17A09 
96BWM01_5 
9 6B WM03_2 
98BWMC12_2 
98BWMC13_4 
98BWMC14_a 
98BWM014_1 
98BWMO!8_d 
98BWM036_a 
98BWM037_d 
99BW3932_1 
99BW4642_4 
99BW4745_8 
99BW4 754_7 
99BWMC16 — 8 
A2_CD_97CD 
A2_CY_94CY 

A2D 97KR 

A2G_CD_97C 
A_BY_97BL0 
A_KE_Q23_A 
A_SE_SE659 
A_SE__SE725 
A_SE_SE753 



951 962 
RRIRQGFEAA LQ 
RRIRQGPEAA LQ 
RRIRQGFEAA LQ 
RRIEQTFEPP LL 
RRIRQGVEAA LQ 
RRIRQGFETA LL 
RRIRQGFEAA LQ 
RRIRQGFEAA LQ 
RRIRQGFEAA LQ 
TRIRQGFEAA LQ 
RRIRQGFEAS LL 
RRIRQGFEAA LL 
TRIRQGFEAA LQ 
TRIRQGFEAA LQ 
RRIRQGFEAA LQ 
RRVRQGFETA LL 
RRIRQGFEVA LL 
TRIRQGFEAA LL 
SSIRQGFEAA LQ 
RRIRQGFEAA LL 
TRIRQGFEAA LL 
RRIRQGFEAA LQ 
RRLRQGFEAA LL 
RRIRQGFEAA LL 
RRIRQGFEAA LL 
TRIRQGFEAA LQ 
RRIRQGFEAS LL 
RRIRQGFEAA LQ 
RRIRQGFEAA LQ 
TRIRQGFEAA LQ 
RRIRQGFEAA LQ 
RRIRQGFEAA LL 
RRIRQGFETA LL 
RRIRQGFEAA LQ 
RRVRQGFEAA LQ 
RRLRQGFEAA LQ 
TRIRQGLEAA LQ 
RRIRQGFEAA LL 
RRIRQGFEAA LL 

GFEAA LQ 

RRIRQGFETA LL 
RRVRQGFEAA LQ 
TRIRQGLEAA LL 
RRIRQGFEAA LQ 
TRIRQGFEAA LL 
RRIRQGFEAA LL 
RRIRQGFETA LL 
RRIRQGFEAA LQ 
TRIRQGFEAA LQ 
RRIRQGFEAA LQ 
RRIRQGFETA LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRXGAEKA LQ 
VRIRQGLERA LL 
RRIRQGFERA LL 
RRIRQGFEEA LL 
RRIRQGFERA LL 
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A_SE_SE853 

A_SE_SE889 

AJ3E_UGSE8 

A_UG_92UG0 

A_UG_U455_ 

AC_IN_213 0 

AC_RW_92RW 

AC_SE_SE94 

ACD_SE_J3E8 

ACG_BE_VI1 

AD_SE_SE69 

AD_SE_SE71 

ADHK_NO_97 

ADK_CD_MAL 

AG_BE_VI11 

AG_NG_92NG 

AGHU_GA_VI 

AGU_CD_Z32 

AJ_BW_BW2 1 

B_AU_VH_AF 

B_CN_RL42_ 

B_DE_D31_U 

B_DE_HAN__U 

B_FR_HXB2_ 

B_GA_OYI 

B__GB__CAM1__ 

B_GBj3B8jC* 

BJ3B_MANC_ 

B__KR_WK__AF 

B_NL_3202A 

BJTWJTWCYS 

B_US_BC_L0 

B_US_DH123 

B_US_JRCSF 

B_US_MNCG_ 

BJUS_P896__ 

B_US_RF_M1 

B_US_SF2_K 

B_US_WEAU1 

B_US_WR27_ 

B_US_YU2_M 

BF1_BR_93B 

C_BR_92BR0 

C_BW_96BW0 

C_BW_96BW1 

C_BW_96BW1 

C_BW_96BW1 

C_ET_ETH2 2 

C_IN_93IN1 

C_IN_93IN9 

C_IN_93IN9 

C_IN_94IN1 

C_IN_95IN2 

CRF01_AB_C 

CRF01_AE_C 

CRF01_AE_C 

CRF01_AE_T 

CRF01_AE__T 

CRF01_AE_T 

CRF01_AE_T 

CRF01 AE T 



VRIRQGFERA LL 
RRSKQGLKRA LQ 
RRIRQGFER. 
RRIRQGFERA LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
SRIRQGFEAA LQ 
RRIRQGFERA LL 
RRIRQGLERA LL 
RRIRQGFERA LL 
ARIRQGLERV LL 
RRIRQGLERA LL 
RRIRQGFERX LL 
RRIRQGFERA LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERA LI 
RRIRQGLERA LL 
VRIRQGFERA LL 
RRIRQGLERL LL 
TRIRQGLERA LL 
VRIRQGLERA LL 
RRVRQGLERA LL 
RRIRQGLERI LL 
RRIRQGLERA LL 
RRIRQGLERL LL 
TRIRQGLERA LQ 
VRIRQGLERA LL 
RRIRQGLERA LL 
VRIRQGLERA LL 
TRIRQGLERA LL 
RRIRQGLERL LL 
TRIRQGLERA LL 
TRIRQGLERA LL 
TRIRQGLERA LL 
TRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERL LL 



RRIRQGLERV LL 
VRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGFEAA LQ 
TRIRQGFEAA LQ 
RRIRQGFETA LL 
RRIRQGFEAA LQ 
RRVRQGFEAA LQ 
RRIRQGLEAA LQ 
TRIRQGFEAA LQ 
RRIRQGFEAV LQ 
TRIRQGFEIA LQ 
RRIRQGLEAA LQ 
RRIRQGFEAA LQ 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERT LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERA LL 
RRIRQGLERT LL 
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CRF01_AEJT RRIRQGLERA LL 
CRF02_AG_F RRIRQGLERA LL 
CRF02_AGJF VRIRQGLERA LL 
CRF02_AG_G RRIRQGFERA LL 
CRF02_AGJff RRIRQGFERA LL 
CRF02_AG_S RRIRQGFERA LL 
CRF02_AG_S RRIRQGLERA LQ 
CRF03_AB_R RRIRQGAEKA LQ 
CRF03_AB_R RRIRQGAEKA LQ 
CRF04_cpx_ RRIRQGLERA LL 
CRF04_cpx__ RRIRQGFEKA LL 
CRF04_cpx_ RRIRQGLERA LL 
CRF05_DF_B RRIRQGLERA LL 
CRF05_DF_B RRIRQGLERA LL 
CRF06_cpx_ RRIRQGFERA LL 
CRF06_cpx_ TRIRQGFERA LL 
CRF06_cpx_ RRIRQGAERA LI 
CRF06_cpx_ RRIRQGFERA LL 
CRFll_cpx_ RRIRQGLERA LL 
CRFll_cpx_ RRIRQGFERA LL 
D__CD_84ZR0 TR I RQGLERA LL 
D_CD_ELI_K RRIRQGLERS LL 
D_CD_NDK_M RRIRQGLERL LL 
DJJG_94UG1 VRIRQGLERA LL 
F1_BE_VI85 RRIRQGAERA LL 
F1_BR_93BR RRIRQGLERA LL 
F1_FI_FIN9 RRIRQRVERA LI 
F1_FR_MP41 RRIRQGLERS LL 
F2_CM_MP25 RRIRQGLERA LL 
F2KU_BE_VI RRIRQGFERA LL 
G_BE_DRCBL RRIRQGLERA LL 
G_NG_92NG0 TRIRQGLERA LL 
G_SEJSE616 TRIRQGLERA LL 
H_BE_VT991 RRIRQGFERA LL 
H_BE_VI997 RRIRQGLERI LL 
H_CF_90CFO RRIRQGFERS LL 
JJSE_SE702 RRIRQGLERA LL 
J_SE_SE788 RRIRQGLERA LL 
K_CDJEQTB1 RRIRQGFERL LL 
K_CM_MP535 RRIRQGLERA LL 
N_CM_YBF30 RRIRQGLERA LI 
O_CM_ANT70 RRIRQGLERS LL 
0_CM__MVP51 RRIRQGAERI LV 
OjSN_99SE_ RRIRQGLERS LL 
0_SN_99SE_ RRIRQGLERA LL 
U_C D 83C RRIRQGFERA LL 
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Table 13. HIV Nef Sequence Alignment 
GCG Multiple Sequence File. 
Written by Omiga l . 1 



til a. 1 1 iv-* - 


00BW0762 1 


Lezi i 


232 


V-X1C O JV . 


14 fil 

J l i O 1 


Mp-i rrnt- « 
wcxyxit . 




. uu 




00BW0768~ 2 


Tien * 


2 ^2 

£t J £t 


Phpflf • 


J D J U 


Wo-i rrht- • 

wtr xyxiL- . 


i 


. UU 


Name : 


0 0BW0874~~2 


Leu * 


2^2 

£k O A 


PTippV • 
v_xxtrv- j\. • 


14 PI 
j *± O 3 


nclyllL . 


1 


. UU 




00BW1471 2 


XJGXX • 


2*^2 

ZL. -J ^> 


V_ll t- 1_ JV . 


Q4 Q1 

i?fr jX 


Ta7 a *i V» +** • 

weiyxiL : 


1 


. uu 


X\J Cllllv_. . 


00BW1616 2 


T.pTI « 

Xj c 11 • 


2^9 


V-llt- l_ XV . 


XD Ufr 


TXT A *i i^r"V\ ♦« . 

i/vciynt : 


1 


. uu 


XNdlllG • 


\J \J XJ M i U O V O 


XJCiX . 


9 "^2 


f'Vi onlr • 
V-11CL»JV . 


1 1 Q n 


wexync : 


1 


. U U 


KTsmp - 

1 V Ct 1 1 1 C . 


0 1 

U V/DVVX / -J _7 «j 


T-ieri : 


£, J 4 


Lucca. : 




weigiiL : 


1 


. UU 


IMCtlllC . 


00BW177'*i 2 


ucu . 




V_llC- A. ; 




weiyHL : 


1 . 


UU 


XNCLIUG . 




XjcXX . 


9 *^ 9 
^. o Z 




DUD J 


weignt : 


1 
X 


. u u 


Msmp ■ 
IncHUC • 


U UJDIfVx / -/J O 


Xjen : 


9 9 


Check : 


71 no 


weignt : 


1 


. 00 


IMclUIC . 


nn'RWi ri i i 

UUDWIOll J 


Lexi 


9 19 
/L 3 £ 


LXlcCK : 


A A £Zf\ 

44bLI 


Weight : 


1 


. 00 


in a. me . 




Lsn 


9 1 9 


LiiecK : 


y lib 


weignt : 


1 


. 00 


IN allies . 


UUDVllOOU ^ 


Len 


919 

^ O A 


Check : 


A 1 1*1 0 
4JUZ 


weignt : 


1 


.00 


KTamo • 


U VJ J3 VV J. _7 x x 


Leu 


9 19 


Vi T/- « 


OT3*7 


weignt : 


1 


. 00 


XNctluC . 




Len 


9 19 


Check : 


/I CCQ 

ft DO O 


Weight * 


1 


.00 


"NT a m • 
IN clitic • 






9 19 




1 Aon 


weignt 


1 


- uu 


iMcime . 


UUDWZUO / Z* 


Len 


919 


^ ^ £-«v T/- . 


*7 C 1 O 

/biz 


weignt : 


1 


. 00 


IN d 1 1 It: ■ 


fi 0BW2 127 2 


Len 


9 19 
» A o Z. 


^iieciv . 


J4ZD 


Weight 


1 


. uu 






T.pri 

XJC1X 


919 


fVi o V" • 


CI OC 


wexgnt 


1 


. uu 


Name • 


00BW22 76 7 


XJCli 


9 "^2 


\^11CL<J\. . 


1^91 
OuZ J 


wexgiit 


1 


. u U 


in a. i uc • 


0 0RW3 819 3 


UCJ.1 


919 
• a 3 ^ 




QQ1 

-7 27 J 


weignt : 


1 . 


U U 


INCllllC • 


0O"RW^R4.2 ft 


T.on 
xj ell 


» 919 


V_llCi Js. . 




weignt 


1. 


. UU 


inch tier . 


OORW^fl71 1 

UUDr« JO /x O 




919 


/"•Tua r"« lr • 

ixecjv . 


1 


weignt 


1 

1 


. U U 


XMCLlllC* . 




T.pn 

XJCXX 


9^9 


Vw-il t. JV . 


1 Qt:i 


wexgnt 


: 1 


. UU 


Name : 


0OBW3 886 8 


ucu 


919 
» <i o i 




j / OD 


wexgnt 


; i 


. U U 


Name : 


0 0BW3 8 91 6 

U VDTV J O v_> 


T.pn 

XJC1X 


919 


r** n Ci/-»lr - 
v-llCJ v^iV . 


jo 3D 


wexgnt 


1 


. U U 


IN cl lilt: • 


nnRW^Q7n 9 


Len 


» 919 


Check ; 


Q Q1 1 


Weight 


: 1 


.00 


in ct i lie . 




Len 


• 919 




9 991 
Z> Z\ Z\ z> 


weignt 


: 1 


. UU 


IMalllC . 


1X3 Z X 


Len 


9 19 
£ 5 z, 


Lxiecjs. z 


Zi/O 


TaT A A /»rVi 4> 

weignt 


: 1 


. uu 


Name : 




Len 


» 9 19 


u.necK : 


jZbl 


weignt 


; 1 


. 00 


IN Cl 1 1 It. . 


QfiRwnmi9 


Len 


9 19 
. Z. 3 z. 


Check : 


in 


Weight : 


1 . 


A A 

uu 




^DDVUUD U *± 


Len 


, 9 19 
: Z\ 3 Z 


Check : 


o / 


TaTa -i /-tVi V 

weignt 


: 1 


. 00 


IN CUllC . 


Q^RMI T A *T 

jDDWIx uo 


Len 


* 919 


Check : 




TaT A ■* /<rVi +- 

weignt 


1 


. 00 


XNcllllG . 


9 fiRWI 91 n 

-7 DXOV* X ^ XV/ 


Len 


» 9 19 




olio 


weignt 


: 1 


A A 

. uu 


XNCllllG • 


9fiRWi RRfl^ 

^ODWX 3DU J 


Len 


» 9 19 
. ZJZ 


LilcCJ\ Z 




weignt 


: 1 


A A 

. uu 


xmglillc: • 


9fi"RW1 9fi 


T.pn 
XJ Gil 


912 
• Z 3 z 




3957 


Weight 


1 


. 00 


Name : 




Len 


i 232 


wlCLJV ■ 


1945 


Weight 


: 1 


.00 


Name * 


96BWMOT R 




9 "3 9 

• Zi 3 z. 




5827 


Weight 


1 


.00 


Name : 


96BWM03 2 


Len 


: 232 




2303 


Weight 


: 1 


.00 


Name • 


98BWMC12 2 


Len 


: 232 


^iici^Jv • 


2423 


Weight 


: 1 


.00 


Name : 


98BWMC13 4 


Len 


: 232 


CTier'lc • 

VtfllClpf A, . 


4043 


Weight 


1 


.00 


Name : 


98BWMC14 a 


Len 


. 232 




3568 


Weight 


i 1 


.00 


Name : 


98BWM014~~1 


Len 


j 232 


r'her'lc • 

J. J. c: V— - xv. m 


4909 


Weight 


: 1 


.00 


Name : 


98BWM018 d 


Len 


: 232 


Check • 


3505 


Weight 


1 


.00 


Name : 


98BWM036 a 


Len 


. 232 


V*JULC ^ J\. . 


6393 


Weight 


: 1 


.00 


Name : 


98BWM037~d 


Len 


: 232 


Check: 


1912 


Weight 


: 1 


.00 


Name : 


99BW3932_1 


Len 


: 232 


Check: 


19 Weight : 


1.00 


Name : 


99BW4642_4 


Len 


: 232 


Check: 


6848 


Weight 


1 


.00 


Name: 


99BW4745_8 


Len 


: 232 


Check: 


93 8 


Weight : 


1. 


00 


Name : 


99BW4754_7 


Len 


: 232 


Check : 


1379 


Weight 


1 


.00 


Name : 


99BWMC16_8 


Len 


t 232 


Check: 


4222 


Weight 


: 1 


.00 


Name : 


A2_CD_97CD 


Len 


: 232 


Check : 


2359 


Weight 


1 


.00 


Name : 


A2_CY_94CY 


Len 


: 232 


Check : 


5163 


Weight 


1 


.00 


Name : 


A2D 97KR 


Len 


: 232 


Check : 


9468 


Weight 


1 


.00 


Name : 


A2G_CD_97C 


Len 


: 232 


Check : 


4189 


Weight 


1 


.00 


Name : 


A_BY_97BL0 


Len 


: 232 


Check : 


2590 


Weight 


1 


.00 


Name : 


A_KE_Q23 


Len 


: 232 


Check : 


2652 


Weight 


1 


.00 
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Name : 


A_SE_SE659 


Len 


: 232 


Name : 


A_SE_SE725 


Len 


232 


Name : 


A_SE_SE753 


Len 


: 232 


Name : 


A_SE_SE853 


Len 


232 


Name ; 


A_SE_SE889 


Len 


: 232 


Name : 


A_SE_UGSE8 


Len 


: 232 


Name : 


A_UG_92UG0 


Len 


; 232 


Name : 


A_UG_U455 


Len 


: 232 


Name : 


AC_IN__213 0 


Len 


: 232 


Name : 


AC_RW_92RW 


Len 


: 232 


Name : 


AC_SE_SE94 


Len 


: 232 


Name : 


ACD_SE_SE8 


Len 


: 232 


Name : 


ACG_BE_VI1 


Len 


232 


Name : 


AD SE SE69 


Len 


: 232 


Name : 


AD_SE SE71 


Len 


: 232 


Name : 


ADHK NO 97 


Len 


: 232 


Name : 


ADK CD MAL 


Len 


: 232 


Name : 


AG_BE VI 11 


Len 


: 232 


Name : 


AG_NG_92NG 


Len 


: 232 


Name : 


AGHU GA VI 


Len 


: 232 


Name : 


AGU CD Z32 


Len 


i 232 


Name : 


AJ_jBW_BW2 1 


Len 


: 232 


Name : 


B_AU_VH 


Len 


: 232 


Name : 


B_CN_RL42 


Len 


: 232 


Name : 


B_DE_D31 


Len 


: 232 


Name : 


B_DEJKAN 


Len 


: 232 


Name : 


B_FR_HXB2 


Len 


: 232 


Name : 


B_GA_OYI 


Len 


r 232 


Name : 


B_GB_CAM1 


Len 


: 232 


Name : 


B_GB — GB8 


Len 


: 232 


Name : 


B_GB_MANC 


Len 


i 232 


Name : 


B_KR_WK 


Len 


: 232 


Name : 


B_NL_3202A 


Len 


: 232 


Name : 


B_TW_TWCYS 


Len 


: 232 


Name : 


B_US_BC 


Len 


: 232 


Name : 


B_US DH123 


Len 


: 232 


Name : 


B_US_JRCSF 


Len 


; 232 


Name : 


B_US_MNCG 


Len 


: 232 


Name : 


B_US_P896 


Len 


: 232 


Name : 


B US RF 


Len 


: 232 


Name : 


B_US_SF2 


Len 


: 232 


Name : 


B_US_WEAU1 


Len 


: 232 


Name : 


B_US_WR27 


Len 


: 232 


Name : 


B_US_YU2 


Len 


: 232 


Name : 


BF1_BR_93B 


Len 


: 232 


Name : 


C_BR_92BR0 


Len 


: 232 


Name : 


C_BW_96BW0 


Len 


» 232 


Name : 


C_BW_96BW1 


Len 


: 232 


Name : 


C_BW_96BW1 


Len 


: 232 


Name : 


C_BW_96BW1 


Len 


: 232 


Name : 


C_ET_ETH2 2 


Len 


: 232 


Name: 


C_IN_93IN1 


Len 


; 232 


Name : 


C_IN_93IN9 


Len 


: 232 


Name : 


C_IN_93IN9 


Len 


232 


Name: 


C_IN 94 INI 


Len 


232 


Name: 


C_IN_95IN2 


Len 


232 


Name : 


CRF01_AE_C 


Len« 


232 


Name : 


CRF01_AE_C 


Len 


232 


Name : 


CRF01_AE_C 


Len: 


232 


Name : 


CRF01_AE_T 


Len: 


232 


Name : 


CRF01_AE_T 


Len, 


232 
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Check 


: 9245 


Weight 


1 . 00 


Check 


: 985 


Weight : 


1. 00 


Check 


: 1638 


Weight 


1 . 00 


Check 


: 2503 


Weight 


1.00 


Check 


: 2327 


Weight 


1.00 


Check 


: 9538 


Weight 


1 . 00 


Check 


: 2621 


Weight 


1 . 00 


Check 


: 2084 


Weight 


1.00 


Check 


: 2406 


Weight 


1 . 00 


Check 


3441 


Weight 


: 1.00 


Check 


: 3488 


Weight 


: 1. 00 


Check 


: 3016 


Weight 


: 1.00 


Check 


: 5006 


Weight 


: 1.00 


Check 


: 3362 


Weight 


: 1 . 00 


Check 


: 2262 


Weight 


1.00 


Check 


: 8765 


Weight 


1.00 


Check 


: 6397 


Weight 


: 1.00 


Check 


: 6471 


Weight 


1 . 00 


Check 


t 2880 


Weight 


: 1.00 


Check 


: 9053 


Weight 


1.00 


Check 


: 523 


Weight : 


1 . 00 


Check 


: 3842 


Weight 


1.00 


Check 


: 8468 


Weight 


1.00 


Check 


: 9366 


Weight 


; 1.00 


Check 


: 3989 


Weight 


1.00 


Check 


: 563 


Weight : 


1.00 


Check 


: 3184 


Weight 


1. 00 


Check 


: 5511 


Weight 


1 . 00 


Check 


: 4779 


Weight 


: 1 . 00 


Check 


: 1128 


Weight 


: 1.00 


Check 


: 2885 


Weight 


: 1.00 


Check 


: 9915 


Weight 


: 1 . 00 


Check 


: 3135 


Weight 


i 1. 00 


Check 


: 2211 


Weight 


: 1 . 00 


Check 


: 3145 


Weight 


: 1. 00 


Check 


: 7019 


Weight 


1.00 


Check 


: 4099 


Weight 


1.00 


Check 


4137 


Weight 


: 1.00 


Check 


. 4405 


Weight 


: 1.00 


Check 


: 450 


Weight : 


1. 00 


Check 


: 5413 


Weight 


: 1.00 


Check 


5335 


Weight 


: 1. 00 


Check 


s 3720 


Weight 


: 1.00 


Check 


9943 


Weight 


1 . 00 


Check 


3598 


Weight 


1 . 00 


Check 


3908 


Weight 


1. 00 


Check 


3880 


Weight 


1. 00 


Check 


4542 


Weight : 


1. 00 


Check 


6118 


Weight : 


1. 00 


Check : 


5089 


Weight : 


1.00 


Check: 


744 


Weight : 


1.00 


Check 


943 


Weight : 


1.00 


Check : 


1241 


" Weight : 


1.00 


Check: 
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PICQHGMEDE 
PICQHGMEDP 
PMCQQGMEDP 
PINQHGMEDP 
PMS QHGMEDE 
PMS QHGMEDE 
PMS QHGMEDE 
PMSQHGMDDE 
PMSLHGMEDD 
PICQHGIDDT 
PICQHGMEDE 
PICQHGMEDE 
PICQHGMEDE 
PMCQHGIEDP 
PICQHGMEDE 
PMSLHGMEDD 
PACQHGIEDE 
PICQHGIEDE 
PVNQHGMEDE 
PINQHGMEDE 
PICQHGADDD 
PACAHGFEDT 
PACNHGAEDA 
PACNHGFEDN 
PVCNHGFEDP 
PICQHGMDDE 



00BW0762 
OOBW0768^ 
00BW0874^ 
00BW1471 
00BW1616* 



201 

_1 HREVLMWKFD 
_2 EREVLRWKFD 
2 DREVLKWQFD 
2 DREVLKWVFD 
2 DREVLRWKFD 



SSLARRHVAR E 
SSLARRHMAR E 
SSLVRRHMAR E 
SSLARRHMAR E 
SSLARRHLAR E 



232 

. LHPEYYKD C. 
. LHPEYYKD C. 
.LHPEYYKD C. 
.LHPEYYKD C. 
►LHPEYYKD C. 
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00BW1686_8 
00BW1759_3 
00BW1773_2 
00BW1783_5 
00BW1795_6 
00BW1811_3 
00BW1859_5 
00BW1880_2 
00BW1921_1 
00BW2036_1 
00BW2063_6 
00BW2087_2 
00BW2127_2 
00BW212 8_3 
00BW2276J7 
00BW3 819_3 
00BW3 842_8 
00BW3871_3 
0 0BW3 876_9 
00BW3886_8 
00BW3 891_6 
00BW3970_2 
00BW5031_1 
96BW01B21 
96BW0407 
96BW0502 
96BW06_J4 
96BW11_06 
96BW1210 
96BW15B03 
96BW16_26 
96BW17A09 
96BWMO!_5 
9 6B WM03_2 
98BWMC12_2 
98BWMC13_4 
98BWMC14_a 
98BWM014_1 
98BWM018_d 
98BWM036_a 
98BWM037_d 
99BW3932_1 
99BW4642_4 
99BW4745_8 
99BW4754_7 
99BWMC16_8 
A2_CD_9 7 CD 
A2_CY_94CY 

A2D 97KR 

A2G_CD_97C 
A_BY_97BL0 
A_KE_Q23 
A_SE_SE659 
AJSEJSE725 
A_SE_SE753 
A_SE_SE853 
A_SE_SE889 
A_SE_UGSE8 
A_UG_92UG0 
A_UG_U455 
AC IN 2130 



DREVLMWKFD 
HGEVLMWKFD 
HGEVLKWKFD 
DKEVLQWKFD 
DREVLMWKFD 
HGEVLEWKFD 
EREVLRWKFD 
HKEVLRWKFD 
DREVLMWKFD 
HKEVLTWKFD 
HREVLQWKFD 
DREVLKWVFD 
EREVLQWKFD 
HGEVLMWKFD 
EREVLKWKFD 
EGEVLQWKFD 
DREVPMWKFD 
DREVLMWKFD 
DREVLRWKFD 
EREVLKWKFD 
DREVLRWKFD 
DREVLKWQFD 
DREVLRWKFD 
HREVLKWKFD 
DREVLRWKFD 
HGEVLKWKFD 
EREVLTWKFD 
HKEVLKWKFD 
DREVLKWKFD 
DREVLMWKFD 
ERGVLKWKFD 
DREVLKWVFD 
HREVLKWKFD 
HGEVLMWKFD 
EREVLKWQFD 
GKEVLIWKFD 
DREVLMWKFD 
EKEVLKWVFD 
DREVLRWKFD 
DREVLKWEFD 
DREVLKWQFD 
DREVLKWKFD 
DREVLIWKYD 
DREVLRWKFD 
HKEVLKWKFD 
DREVLKWQFD 
EREVLKWKFD 
ERE VLRWE FD 
EREVLKWVFD 
DKQVLGWRFD 
EKEVLMWKFD 
EREVLKWKFD 
EKEVLKWKFD 
EKETLRWRFD 
EREVLKWKFD 
ERETLMWKFD 
ERETLMWKFD 
ERETLMWKFD 
EKETLRWKFD 
EKEVLMWKFD 
YGEVLQWKFD 



SHLAYRHMAR E.LHPEYYKD C 
SALARRHMAR E.LHPEYYKD C 
SSLARRHLAR E.KHPEFYKD C 
SSLARRHMAR E.LHPEYYKD C 
SHLARRHTAR E . LHPEFYKD C 
SMLARRHMAR E.LHPEYYKD C 
SQLARRHMAR E.LHPEYYKD 
SSLARRHLAR E.LHPEYYKD 
SQLAHRHLAR E.KHPEWYKD 
SHLARRHMAR E.LHPEYYKD 
SQLARRHIAR E.RHPEYYKD 
SSLARRHLAR E.KHPEFYKD 
SLLSRRHLAR IXIHAEYYKD 
SHLAYRHMAR E.KHPEFYKD 
SSLARRHMAR E . LHPEYYKA 
SLLAYRHMAR E.QHPEYYKD 
SLLAHRHMAR E.LHPGYYKD 
SHLVHRHMAR E . LHPEYYKG 
S.LARRHIAR E . LHPEYYKN 
SQLAYRHMAR E . IHPEYYKD 
SHLARRHMAR E . LHPEWYKD 
ISLARRHMAR E. LHPEWYKD 
SELARRHIAR E.RHPEFYKD 



C 
C 

c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 
c 



c 
c 
c 
c 
c 
c 
c 
c 
c. 
c. 
c. 
c. 
c. 



SQLARRHMAR E.LHPEYYKD 
SSLAHRHMAR E.LHPEYYKD 
SQLARRHMAR E.LYPEYYKD 
SHLVHRPMAR E . IHPEYYKD 
SQLARRHLAR E. LHPEFYKD 
SSLARRHLTR E . KHPEYYKD 
SHLARRHMAR E.LHPEYYKD 
SHLARRHMAR E.LHPEYYKD 
SHLARKHMAR E. LHPEYYKN 
SSLARRHMAR E . LHPEFYKD 
SQLARRHMAR E.LHPEYYKD 
SSLARRHMAR E.LHPEYYKD 
SHLARRHMAR E . LHPEFYKD C. 
SQLARRHIAR E . IHPEYYKD C. 
SSLARRHVAR E. LHPEFYKD C. 
SSLARRHMAR E.LHPEYYKD C. 
IRLAHTHMAR Q.LHPEFYKN C- 
SSLAHRHVAR E.LHPEYYKD C." 
SMLARRHMAR E. LHPEFYKD C. 
SQLARRHMAR E . LHPDYYKD C. 
SHLARRHMAR E. LHPEFYKD c! 
SHLARRHMAR E. LHPEFYKD C. 
SSLARRHMAR E.LHPEYYKD c! 
SRLALRHLAR E.QHPEFYKD C. 
RSLARRHRAR E.LHPEYYKD C. 
SHLALVHKAR E. LHPEFYKD C. 
SSLARRHLAR E. KHPEYYKD C. 
SRLALKHRAR E. LHPEFYKD c! 
SRLALKHRAR E. LHPEWYKD C. 
SRLALKHLAC E.KHPEFYKD C. 
SRLALRHRAQ E . MHPEFYKD C. 
SRLALKHRAQ E. LHPEFYKD C. 
SKLALKHRAH E.LHPEYFKN C. 
SRLALTHRAR E . LHPEFYKD C. 
PHLAFKHRAF E . LHPEYYKN 
SSLARVHKAR E. LHPEFYKD C. 
STLALKHRAY E. LHPEFYKD 
SHLAYKHQAR E.RHPEFYKD c! 
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AC_RW_92RW 
AC_SE_JSE94 
ACD_SE_SE8 
ACG_BE_VI1 
AD_SE_SE69 
AD_SE_SE71 
ADHK_NO_97 
ADK__CD_MAL 
AG_BE_VT11 
AG_NG_9 2NG 
AGHU_GA_VI 
AGU_CD_Z32 
AJ_BW_BW2 1 
B_AU_VH 
B_CN_RL42 
BJDE_D31 
B_DE_HAN 
B_FR__HXB2 
B_GA_OYI 
B_GB_CAM1 
B_GB_GB8 
BJ3B__MANC 
B_KR_WK 
B_NL_32 02A 
B_TW_TWCYS 
B_US_BC 
B_US_DH123 
B_US_JRCSF 
B_US_MNCG 
B_US_P896 
B_US_RF 
B_US_SF2 
B_US_WEAU1 
B__US_WR27 
B_JJS_YU2 
BF1_ BR_93B 
CJBR_92BR0 
C_BW_96BW0 
C_BW_96BW1 
C__BW_96BW1 
C_BW_96BW1 
C_ETJETH22 
C_IN_93IN1 
C_IN_93IN9 
C__IN_93IN9 
C_IN_94IN1 
C_IN_95IN2 
CRF01_AE_C 
CRF01_AE_C 
CRF01_AE_C 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE__T 
CRF02_AG_F 
CRF02_AG_F 
CRF02_AG_G 
CRF02_AG_N 
CRF02 AG S 



DREVLKWKFD 
ERETLVWRFD 
DKEVLRWKFD 
DREVLVWRFD 
EREVLMWRFN 
EKEVLKWQFD 
EXEVLMWRFD 
EREVLKWKFD 
EREVLVWKFD 
DREVLVWRFD 
EREVLMWKFD 
EREVIiMWKFD 
DREVLMWKFD 
EKEVLMWKFD 
EREVLMWKFD 
EREVLVWRFD 
EREVLKWKFD 
EREVLEWRFD 
EKEVLVWKFD 
EKEVLMWKFD 
EKEVLVWKFN 
EKEVLVWKFD 
EGEVLVWRFD 
EREVLEWRFD 
EKEVLVWRFD 
EREVLEWRFD 
EKEVLLWKFD 
EKEVLVWKFD 
EREVLVWKSD 
ERQVLVWRFD 
EKEVLVWKFD 
EKEVLVWRFD 
EKEVLMWKFD 
EKEVLVWKFD 
EREGLEWRFD 
DREILQWRFD 
HREVLQWKFD 
DGEVLRWKFD 
HKEVLKWKFD 
DREVLKWKFD 
DKEVLMWKFD 
DREVLKWKFD 
HREVLKWKFD 
HREVLQWKFD 
HREVLQWKFD 
HREVLMWK. . 
HNEVLVWKFD 
EREVLMWKFD 
EREVLMWKFD 
EREVLMWKFD 
EREVLMWKFD 
EREVLMWKFD 
EREVLIWKFD 
EREVLMWKFD 
EREVLMWKFD 
EREVLIWKFD 
DREVLVWRFD 
DREVLVWRFD 
DREVLVWRFD 
DREVL I WRFD 
DREVLVWRFD 



SHLAHRHMAR 
SRLALKHLAR 
SQLARRHMAR 
SRLALKHIAK 
SRLAFEHKAH 
SRLALKHLAR 
SRLAFKHRAR 
SSLALRHRAR 
SMLAFKHRAR 
SSLARRHIAR 
SSLAREHVAR 
SSLARKHLAR 
SSLARRHLAR 
SRLAVHHMAR 
SRLAIHHMAR 
SRLAFKHMAR 
SHLAFHHKAR 
SRLAFHHVAR 
SRLAFRHMAR 
SRLAFHHMAR 
SRLAFHHMAR 
SRLAFHHVPD 
SRLAFHHMAR 
SRLAFHHMAR 
STLAFHHRAR 
SRLAFHHMAR 
SRLAYHHMAR 
SKLALHHVAR 
SHLAFQHYAR 
SRLAFHHVAR 
SRLAFHHVAR 
SRLAFHHMAR 
SKLAFHHVAR 
SRLAFHHKAR 
SRLAFHHVAR 
SRLAFHHMAR 
SLLARRHMAR 
SHLAHRHMAR 
SQLARRHLAR 
SSLARRHLTR 
SHLARRHMAR 
SHLARRHMAR 
SQLARRHMAR 
SLLAHRHRAR 
SHLAHRHMAR 
. QLAHRHIAR 
SQLAHKHRAR 
SSLARRHIAR 
SSLARRHIAR 
SSLARRHIAR 
SALARKHTAR 
STLARKHIAR 
SALARRHIAR 
SALARKHIAR 
SALARKHVAR 
SSLARKHLAR 
SSLARTHRAR 
SSLARRHIAR 
SSLAFTHRAR 
SRLAFRHTAR 
SRLAFTHKAR 



E . LHPEYYKD C. 
E. KHPEFYKD C. 
E.MHPEYYKD C. 
E . KHPEYFKD C. 
Q. LHPEYYKD C. 
E. KHPEFYKD C. 
E. LHPEFYKD C. 
E.QHPEYYKD C. 
E. LHPEYYKD C\ 
E.QHPEYYKD C. 
K.LYPEFFKD C. 
E . MHPEFYKD . . 
E . KHPEFYKD C. 
E . LHPEYYKN . . 
E.MHPEYHKD C. 
E. LHPEYYKN . . 
E . LHPEYYKD 
E . LHPEYFKN 
E.VHPEYYKD 
E . KHPEFYKD 
E . LHPEFYKD 
E. LHPEYYKD 
E . KHPEYYKD 
E . LHPEYYKD 
E . LHPEYYKX 
E . LHPEYYKN 
E. LHPEYYKN 
E . LHPEYYKD 
E . LHPEYYKN 
E . LHPEYFKN 
E . KHPEYYKD 
E . LHPEYYKD 
E . LHPEYFKD 
E . LHPEYYKN 
E . LHPEYYKN 
E . LHPEYYKD 
E . LHPEYYKD 
E . LHPEYYKD 
E . LHPEFYKD 
E . KHPEYYKD 
E . LHPEYYKD 
E . LHPEYYKD 
E . LHPEFYKD 
E . LHPEFYKD 
E . LHPEYYKD 
E . LHPEFYKD 
E . LHPEFYNK 
E . LRPEYYKD 
E . LHPEYYKD . . 
E. LHPEYYKD C. 
E. LHPEYYKD C. 
E . QHPEFYKD C. 
E . LRPEFYKD C. 
E.MHPEYYKD C. 
E.QHPEYYKD C. 
E. LHPEYYKD C. 
E. LHPEYYKD C. 
E . RHPEFYKD C. 
E. MHPEFYKD C. 
E. LHPEYYKD C. 
E. MHPEFYKD CX 



C. 
C. 
C. 
C. 
C. 
C. 
C. 
C. 
C. 
R. 
C. 

c. 
c. 

c. 
c. 
c. 



c. 
c. 
c. 
c. 
c. 
c. 
c. 
c. 
c. 
c. 
c. 

DC 

c. 
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CRF02_AG_S DKEVLVWRFD SRLAFRHTAR E.LHPEYYKD C 
CRF03_AB_R EKEVLMWKFD SRLALTHRAR E . LHPEFYKD C 
CRF03_AB_R EKEVLMWKFD SRLALTHRAR E. LHPEFYKD C 
CRF04_cpx_ EREVLKWKFD SRLAYKHVAR E. LHPEFYKD C 
CRF04_cpx_ EREVLKWKFD SRLAFKHIAR E. LHPEFYKD C 
CRF04_cpx_ EREVLKWKFD SLLAYRHMAR E. LHPEFYKD C 
CRF05_DF_B DREVLQWKFD SSLALRHIAR E . RHPEFYQD 
CRF05_DF_B DGEVLRWKFD SSLALKHIAR E . RRPEFYQD 
CRF06_cpx_ EREVLKWKFD SSLARRHIAR E - KHPEFYKD C 
CRF06_cpx_ EGEVLMWKFD SSLARRHIAR E . LHPDFYKD C 
CRF06_cpx__ EREVLMWKFD SSLARRHTAR E - MHPEFYKD C 
CRF06_cpx_ EXEVLMWKFD SSLARRHIAX E.XHPEFXKD C 
CRFll_cpx_ EREVLKWVFD SSLARRHIAR E . LHPDFYKD 
CRFll_cpx_ DREVLRWKFD SSLARRHIAR E . LHPDFYKD 
D_CD_84ZR0 EREVLVWRFN SRLAFEHKAK E . KYPEYFKN C 
D_CD_ELI ERQVLKWRFN SRLAFEHKAR E . MHPEFYKM 
D__CD_NDK ERQVLMWRFN SRLALEHKAR E. LHPEFYKD C 
D_JJG_94UG1 EREVLVWRFN SRLAFEHKAK M.KHPEYYKD C 
F1_BE_VI85 DREVLRWKFD SSLALRHIAR E . RHPEFYQD 
F1_BR_93BR DKEVLKWEFD SRLALRHIAR E . RHPEYYQD 
F1_FI_FIN9 DREVLRWKFD SRLALRHIAR E.RHPEFYRD 
F1_FR_MP41 DREVLRWEFD SRLAFRHIAR E.KHPEFYQN 
F2_CM_MP25 DKEVLKWQFD SRLALRHIAR E.RHPEYYKD 
F2KU_BE_VI EREVLVWKFD SRLALKHLAR E.RHPEYYKD C 
G_BE_DRCBL DGEVLVWRFD SSLARRHIAR E.LHPEYYKD C 
G_NG_92NG0 DREVLVWRFN SSLARRHLAR E.LHPEYYKD C 
G_SE_SE616 DREVLVWRFD SSLARRHIAR E.LHPEYYKD c" 
HJ3E_VI991 EREVLMWKFD SRLALRHRAK E . LHPEFYKD C 
H_BE_VI997 EGEVLMWKFD SRLAFTHTAR E. KHPEFYKD C 
H_CF_90CF0 GREVLMWRFD SRLALTHLAR V.RHPEY RD C 
J_SE_SE702 EREVLKWKFD SSLARRHIAR E. LHPEFYKD C* 
J_SE_SE78 8 EREVLQWKFD SSLARRHIAR E.LHPEFYRD C 
K_CDJ3QTB1 HREVLKWKFD SSLARKHVAR E.MHPEYYKD 
K_CM_MP535 HRE ILMWKFD SSLARRHVAR E.LHPDYYKD 
N_CM_YBF30 HKEVLVWRFD SSLARRHVAR E.LHPEFYKN C* 
O_CM_ANT70 HKE ILMWKFD RSLGNTHVAM ITHPELFQKD 
0_CM_MVP51 HGEILKWQFD RSLGLTHIAL QKHPELFPSN 
0_SN_MP129 HGQILRWQFD RSLGSTHVAM VTNPELFNRD 
O_SN_MP130 HKEMLKWQFD RSLGSTHVAL ITHPELFLKD 
U - CD 83C EKEVLMWKFD SSLARRHLAR E. LHPEFYKD C 
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Table 14. HIV Pol Sequence Alignment 
GCG Multiple Sequence File. 
Written by Omiga 1.1 



jn a. me ! 


UUBWU /o2 1 


Len: 


1046 


Name : 


UUJaWO/oo 2 


Len: 


1046 


Name : 


U UdWUo I *± 2 


Len : 


1046 


Name z 


ATSTaTT /I *7 T O 

UUiJWI^ /± 2 


Len: 


1046 


Name : 


OOBWlolo 2 


Len: 


1046 


Name : 


U0BW16 8 6 8 


Len: 


1046 


Name : 


00BW1759_3 


Len: 


1046 


Name : 


Q0BW1773 2 


Len: 


1046 


Name : 


00BW1783 5 


Len : 


1046 


Name : 


0 0BW1795 6 


Len: 


1046 


Name : 


0 0BW1811 3 


Len: 


1046 


Name : 


00BW1 859__5 


Len: 


1046 


Name : 


ATlT.11 O <-» r\ f*i 

0OBW1880_2 


Len: 


1046 


Name : 


00BW1921 1 


Len: 


1046 


Name : 


0 0BW2036_1 


Len: 


1046 


Name : 


UOBW2 063 6 


Len: 


1046 


Name : 


OOBW2 0 87 2 


Len : 


1046 


Name : 


0 0BW2 12 7_2 


Len : 


1046 


JMame : 


UUBW212 8 3 


Len: 


1046 


Name : 


UUBW2276 7 


Len: 


1046 


Name : 


UUdWj o19 3 


Len: 


1046 


jxame : 


A ADMO O /I 1 O 

UUBW3o42 .8 


Len : 


1046 


JMame : 


UUBW3871 3 


Len: 


1046 


JMame : 


UOBW3876 9 


Len: 


1046 


Name : 


UUBW3 88 6 8 


Len: 


1046 


Name : 


UUBW3891 6 


Len: 


1046 


Name : 


00BW3970 2 


Len: 


1046 


Name : 


A AT) TVT C A T 1 «1 

0 0BW503 1_1 


Len: 


1046 


Name : 


96BW01B21 


Len: 


1046 


Name : 


96BW0407 


Len: 


1046 


Name : 


96BW0502 


Len: 


1046 


Name : 


96BW06 J4 


Len: 


1046 


Name : 


96BWll_06 


Len: 


1046 


Name : 


96BW12 10 


Len: 


1046 


Name : 


96BW15B03 


Len: 


1046 


Name : 


96BW16 2 6 


Len: 


1046 


Name : 


96BW17A09 


Len: 


1046 


JMame : 


ybaWMOl 5 


Len : 


1046 


lvalue : 


q cnwMA o o 
ybBWiyivJo 2 


Len: 


1046 


IMcLlllc Z 


QQTatATiwroi o o 
y ODWiYK- 12 2 


Len : 


1046 


iMcLUicr • 


QQ'DTjJMr , '1 1 A 
i7 0J3ln/i v J\».±«j 4 


Len : 


1046 


T\T.si mo • 

XMCtillC • 


27 OX3W1TIL.X4 a 


Len : 


1046 


■LVdlllG . 




Len : 


1046 




Qflnwivini o /i 


Len : 


1046 


XMclUlC. . 


7oDVVrlUJO a 


T am 

Len : 


1046 


Name : 


98BWM037~d 


Len : 




Name: 


99BW3932_1 


Len: 


1046 


Name : 


99BW4 642 4 


Len: 


1046 


Name : 


99BW4745 8 


Len: 


1046 


Name : 


99BW4754 7 


Len: 


1046 


Name : 


99BWMC16_8 


Len: 


1046 


Name : 


A2_CD_97CD 


Len: 


1046 


Name : 


A2_CY_94CY 


Len: 


1046 


Name: 


A2D 97KR 


Len: 


1046 


Name : 


A2G_CD_97C 


Len: 


1046 


Name: 


A_BY_97BL0 


Len: 


1046 


Name : 


A_KE_Q23_A 


Len: 


1046 



PC17US2004/009510 



Check : 


4376 


Weight : 


1.00 


Check : 


8430 


Weight : 


1.00 


Check: 


8925 


Weight : 


1.00 


Check : 


1324 


Weight : 


1.00 


Check : 


935 


Weight: 




1.00 


Check : 


8131 


Weight : 


1.00 


Check: 


579 


Weight : 




1.00 


Check : 


1975 


Weight : 


1.00 


Check: 


216 


Weight : 




1.00 


Check: 


5932 


Weight : 


1.00 


Check: 


6525 


Weight : 


1.00 


Check : 


2879 


Weight : 


1.00 


Check : 


7093 


Weight 




1.00 


Check : 


2524 


Weight 




1.00 


Check : 


8279 


Weight 




1.00 


Check : 


3935 


Weight 




1.00 


Check: 


7898 


Weight 




1.00 


Check : 


728 


Weight : 




1.00 


Check : 


5356 


Weight 




1.00 


Check : 


9456 


Weight 




1.00 


Check : 


6369 


Weight 


: 


1. 00 


Check : 


4573 


Weight 


: 


1.00 


Check : 


6948 


Weight 


: 


1.00 


Check : 


6609 


Weight 


: 


1.00 


Check: 


8244 


Weight 


: 


1.00 


Check : 


5718 


Weight 


: 


1.00 


Check : 


3940 


Weight 


: 


1.00 


Check : 


2442 


Weight 


: 


1. 00 


Check : 


2358 


Weight 


: 


1.00 


Check : 


8537 


Weight 


: 


1.00 


Check : 


3948 


Weight 


: 


1.00 


Check: 


7173 


Weight 


: 


1.00 


Check : 


973 


Weight : 




1.00 


Check : 


5817 


Weight 




1.00 


Check : 


5157 


Weight 




1.00 


Check: 


3303 


Weight 




1.00 


Check : 


1256 


Weight 




1.00 


Check : 


5593 


Weight 




1.00 


Check : 


3661 


Weight 




1.00 


Check : 


7159 


Weight 




1.00 


Check : 


3254 


Weight 




1.00 


Check : 


5638 


Weight 




1.00 


Check : 


7680 


Weight 




1.00 


Ciieck : 


1619 


Weight 




1.00 


CxiecK : 


8852 


Weight 




1.00 


V—Xiecj Jv ; 


4750 


Weight 




1 . 00 


Check : 


5391 


Weight 




1.00 


Check : 


1514 


Weight 




1.00 


Check : 


52 Weight : 




1.00 


Check : 


4905 


Weight 




1.00 


Check : 


1544 


Weight 




1.00 


Check : 


9703 


Weight 




1. 00 


Check : 


3235 


Weight 




1.00 


Check: 


3776 


Weight : 




1.00 


Check : 


2059 


Weight ; 




1.00 


Check : 


2724 


Weight : 




1.00 


Check: 


1835 


Weight : 




1.00 
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Name 
Name 
Name 
Name 
Name 
Name 
Name 
Name 
Name ; 
Name : 
Name : 
Name: 
Name : 
Name : 
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QMAGADCMAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAS 

QMAGADCVAD 

QMAGADCVAD 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAS 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAS 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGDDCVAG 

QMAGADCVAD 

QMAGADCVAG 

QMAGADCVAG 

QMAGDDCVAG 

QMAGADCVAG 

QMAGADCVAS 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 

QMAGADCVAG 



WKGPAKLLWK 
WKGPAKLLWK 
WKGPAKLLWK 
WKGPAKLLWK 
WKGPAKLLWK 
WKGPAKLLWK 
WKGPAKLPWK 
WKGPAKLLWK 
WKGPAKLLWK 
WKGPAKLLWK 
WKGPAQLLWK 
WKGPAQLLWK 
WKGPAQLLWK 
WKGPAQLLWK 
WKGPAKLLWK 

1046 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDEDQ 
RQDEDQ 
GQDEN . 
RQDEDQ 
RQDEDQ 
GQDEN . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDED . 
RQDED . 
GQDEN. 
RQDED . 
RQDEN. 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDEN. 
RQDED . 
RQDED . 
GQDEN. 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
GQDEN. 
RQDED . 
RQDEDQ 
RQDEDQ 
RQDED . 
RQDEDQ 
RQDED . 
RQDED. 
RQDEDQ 
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98BWM036_a 
98BWM037__d 
99BW3932_1 
99BW4642_4 
99BW4745_8 
99BW4 754_7 
99BWMC16_8 
A2_CD_97CD 
A2_CY_94CY 

A2D 97KR 

A2G_CD_97C 

A_BY_97BL0 

A_KE_Q23_A 

A_SE_SE659 

A_SE_SE725 

A_SE_SE753 

A_SE_SE853 

A_SE_SE889 

A_SE_UGSE8 

A_UG_92UG0 

A_UG_U455_ 

AC_IN_2130 

AC_RW_92RW 

AC_SE_SE94 

ACD_SE_SE8 

ACG_BE_VI1 

AD_SE_SE69 

AD_SE_SE71 

ADHK_NO_97 

ADK_CD_MAL 

AG_BE_VI11 

AG_NG_92NG 

AGHUJ3A_VI 

AGU_CD_Z32 

AJ JBW_BW2 1 

B_AU_VH_AF 

B_CN_RL42_ 

B_DE_D3 1_U 

B_DE_HAN_U 

B_FR_HXB2_ 

B_GA_OYI 

B_GB_CAM1_ 

B_GB_GB8_A 

B_GB_MANC_ 

B_KRJWK_AF 

B_NL_3202A 

B_JTW_TWCYS 

B_US_BC__L0 

B_USJDH123 

B_US_JRCSF 

B_US_MNCG_ 

B_US_P896__ 

B__US_RF_M1 

B_US_SF2_K 

B_US_WEAU1 

BJtJS_WR27_ 

B_US_YU2__M 

BF1JBR_93B 

C_BR_92BR0 

C_BW_96BW0 

C BW 96BW1 



GE.GAWIQD NSD IKWPRR KAKIIKDYGK 
GE.GAWIQD NSD IKWPRR KAKIIRDYGK 
GE.GAWIQD NSD IKWPRR KAKIIKDYGK 
GE.GAWIQD NSD IKWPRR KAKIIKDYGK 
GE.GAWIQD NSD IKWPRR KVKIIKDYGK 
GE.GAWIQD KSDIKWPRR KAKIIKDYGK 
GE.GAWIQD NSD IKWPRR KAKIIKDYGK 
GE.GAWIQD NGDIKWPRR KAKIIRDYGK 
GE.GAWIQD NSD IKWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR RAKIIRDYGK 
GE.GAWIQD NNEIKWPRR KTKILRDYGK 
GE.GAWIQD NXDIKWPRR KAKIIXDXXK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NNDIKWPRR KAKILRDYGK 
GE.GAWIQD NSD I KWPRR KVKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAVLIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD QSDIKWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KVKIIKDYGK 
GE.GAWIQD NSD IKWPRR KAKIIRDYGK 
GE.GAWIQD NSDI KWPRR KAKIIKDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KVKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NS EI KWPRR KVKIIRDYGK 
GE.GAWIQD NS D I KWPRR KAKIIRDYGK 
GE.GAWIQD NGDIKWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAVAIQD NNEIKWPRR KAKIIRDYGK 
GE.GAWIQD NS EI KWPRR KVKIIKDYGK 
GE.GAWIQD NS E I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD IKWPRR KAKIIRDYGK 
GE.GAWIQD NS E I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KVKIIRDYGK 
GE.GAWIQD NS EI KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSE IKWPRR KVKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD KSDIKWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KVKIIRDYGK 
GE.GAWIQD NNDIKWPRR KAKVIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KAKIIRDYGK 
GE.GAWIQD NSD I KWPRR KVKIIRDYGK 
GE.GAWLQD NSD I KWPRR KVKIIKDYGK 
GE.GAWIQD NSD IKWPRR KVKIIRDYGK 
GE.GAWIQD NSD I KWPRR KVKIIRDYGK 



QMAGADCVAG 
QMAGADCVAG 
QMAGAD CVAS 
QMAGADCVAD 
QMAGADCVAG 
QMAGDDCVAG 
QMAGADCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGXDCVAS 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDGCVAG 
QMAGDDCVAS 
QMAGDDCVAG 
QMAGDDCMAG 
QMAGADCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDGCVAG 
QMAGDDCVAS 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGGDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAG 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAG 
QMGSDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAG 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAG 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QTAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAS 
QMAGDDCVAG 
QMAGGDCVAG 
QMAGAD CMAS 
QMAGADCVAG 
QMAGADCVAG 



GQDED. 
RQDEDQ 
RQDED. 
RQDED . 
RQDED . 
RQDED. 
RQDED. 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDED. 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDEDW 
RQDED . 
RQDED. 
RQDED . 
RQDED . 
GQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDED . 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDED . 
RQDED . 
RQDED . 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
RQDED. 
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C_BW_96BW1 

C_BW_96BW1 

C_ET_ETH22 

C_IN_93IN1 

C_IN_93IN9 

C_IN_93IN9 

C_IN_94IN1 

C_IN_95IN2 

CRF01_AE_C 

CRF01_AE_C 

CRF01_AE_C 

CRF01_AE_T 

CRF01_AE_T 

CRF01_AE_T 

CRFO 1_AE_T 

CRF01_AE_T 

CRF01_AE_T 

CRF02_AG_F 

CRF02_AG_F 

CRF02_AG_G 

CRF02_AG_N 

CRF02_AG_S 

CRF02_AG_S 

CRF03_AB_R 

CRF03_AB_R 

CRF04_cpx_ 

CRF04_cpx_ 

CRF04_cpx_ 

CRF05_DF_B 

CRF05_DF_B 

CRF06_cpx_ 

CRF06_cpx_ 

CRF06_cpx_ 

CRF06_cpx_ 

CRFll_cpx_ 

CRFll_cpx_ 

D_CD_84ZR0 

D_CD_ELI_K 

D_CD_NDK_M 

D_UG_94UG1 

F1_BE_VI85 

F1_BR_93BR 

F1_FI_FIN9 

F1_FR_MP41 

F2_CM_MP25 

F2KU_BE_VI 

G_BE_DRCBIi 

G_NG_92NG0 

G_SE_SE616 

H_BE_VI991 

H_BE_VI997 

H_CF_90CF0 

J_SE__SE702 

J_SE_SE788 

K_CD_EQTB1 

K_CM_MP535 

N__CM_YBF3 0 

O_CM_ANT70 

0__CM_MVP51 

Q_SN_99SE_ 

O SN 99SE 



GE.GAWIQD NSDIKWPRR KVKIIKDYGK QMAGADCVAG RQDED 
GEGAVWXQD NSDIKWPRR KVKI IRDYGK QMAGADCVAG RQDED 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGADCVAG RQDED 
GE.GAWIQD NSDIKWPRR KAKI IKDYGK QMAGADCVAG RQDED " 
GE.GAWLQD NSDIKWPRR KAKI IKDYGK QMAGADCVAG RQDED ' 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGADCVAG RQDED 
GE.GAWIQD NSDIKWPRR KAKI IKDYGK QMAGADCVAG RQDED 
GE.GAWIQD NSDIKWPRR KAKI I KDYGK QMAGADCVAG RQDED ' 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDEN* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQNED* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED " 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED " 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAS RQDED ' 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED ' 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED ' 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED ' 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED ' 
GE.GAWIQD KSDIKWPRR KAKI I KDYGK QMAGDDCVAG RQDED ' 
GE.GAWIQD KSDIKWPRR KAKI IKDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KAKILRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KVKIVRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KTKILRDYGK QMAGDDCVAG GQNED * 
GE.GAWIQD NNDIKWPRR KAKI IRDYGK QMAGDDCVAS RQDED* 
GE.GAWIQD NNDIKWPRR KAKI IRDYGK QMAGDDCVAS RQDED* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGNDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IKDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAIVIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAG SQDED * 
GE.GAWIQD NSDIKWPRR KAKI IRDYGK QMAGDDCVAS RQDED* 
GE.GAWIQD KSDIKWPRR KVKI IRDYGK QMAGDDCVAS RQDED* 
GE.GAWIQD NSDIKWPRR KVKI IRDYGK QMAGDDCVAS RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAS RQDED* 
GE.GAWIQD NSEIKIVPRR KAKI IRDYGK QMAVDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NNEIKVIPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NNEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NNEIKWPRR KAKI IiKD YGK QMAGGDCVAG RQDED* 
GE.GAWIQD NNEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED * 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NSEIKWPRR EAKI IRDYGK QMAGDDCVAS RQDED* 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED ' 
GE.GAWIN. .SEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED 
GE.GAWIQD NSEIKWPRR KAKI IRDYGK QMAGDDCVAG RQDED* 
GE.GAWIQD NGDIKWPRR KAKI IRDYGK QMAGDGCVAS GQDENQ 
GE.GAWIQD KGDIKWPRR KAKI IREYGK QMAGTDSMAS GQTESE 
GE.GAWIQD KGDIKWPRR KAKI IRDYGK QMAGTD S MAN RQTESE 
GE.GAWIQD KGDIKWPRR KAKI IRHYGK QMAGTDSMAS GQTESE 
GE.GAWIQD KGDIKWPRR KAKI IRHYGK QMAGTDSMAS GQTESE 
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Table 15. HIV Rev Sequence Alignment 
GCG Multiple Sequence File. 
Written by Omiga 1.1 



JNcLLllC - 


u UCWU / O ^5 X 


Len : 


129 


JNcLLlIC • 


uuowu /oo z 


Len : 


129 


N3.1T1G I 


UuDWUO ^ 


Len : 


129 


"M"a mo * 


r> nTAixn a n i o 

UUflWlfi / 1 ^ 


Len : 


129 


iMaiue . 


UUcnloXb 2 


Len : 


129 


Nsms i 


UUDWlbob 0 


Len : 


129 


Name : 


A A T»7 n T C Q O 

UUJoWX759 3 


Len: 


129 


jm dine : 


UUJaWX / 73 2 


Len : 


129 


Wdiuc : 


A ATaTiJ-i o a o c • 


Len : 


129 


Name : 


U UJdWX / 95 6 


Len: 


12 9 


Name : 


AA"QTaT"1 0 1*1 0 

UUiaWXcXX 3 


Len : 


129 


Name : 


ftftT3M1 OCA f - 

UUiiWlo59 5 


Len: 


129 


Name : 


a a dmi 0 0 r\ 
UUBWlboO 2 


Len : 


129 


Name : 


UUxJWX92X X 


Len: 


129 


Name : 


UUxiWzU Jo X 


Len : 


129 


JMame : 


UUJrJW^Uoo 6 


Len : 


129 


Name : 


00BW2087 2 


Len: 


129 


Name : 


a a dtaTO tot 
UUI5W2X27 2 


Len : 


129 


JMame : 


UUi3VV2276 7 


Len: 


129 


"KT m ^ • 
IN altlc ! 


AAT3TATO.Q1 O *5 

U UdWj o19 3 


Len: 


129 


Name * 




Len : 


129 


Name : 


U U JdW3 0 7 X-_3 


Len: 


129 




U UdvV J O/b y 


Len: 


129 


JMaulc I 


A ATOTaTO O O f O 


Len : 


129 


Name : 


AATSTaT^QOI C 


Len : 


129 


JMalllc * 


A A*DTa1"3 OTA O 


Len: 


129 


"KT ^ TVN ^% _ 

JMame : 


UUJ3WOU3X X 


Len: 


129 


Name : 


ybbWUIB2 X 


Len: 


129 


Name : 


y OJ3W04 07 


Len: 


129 


JMame : 


Q /TT"} 7*7 r\ T~ f\ #■* 


Len: 


129 


Name : 


ybBWU6 U4 


Len: 


129 


Name : 


9oBWll 06 


Len: 


129 


"NT m r~* ■ 

jMame : 


QCDH1 O "I A 

yoBWX2X0 


Len : 


129 


JMame : 




Len: 


12 9 


JMame i 


y DflWlo 2 6 


Len: 


129 


Name : 


yoJ3WX7A09 


Len : 


129 


JMame : 


yboWlYlOX 5 


Len : 


12 9 


jm curie . 


l7bi3WiyivJ3 2 


Len : 


129 




QPRMMPI O O 


T aw 

jjen : 


129 




27 OJDWJYJ^. J. J 4 


Len : 


129 




27 aoWV r l\^J.*± a 


Len : 


129 


Name : 


Q O "RTXTMO 1 A 1 


Len : 


129 


Name : 


QflRWMm ft r\ 


Lien : 


129 


Name : 


JODrvriUj D cl 


Len : 


129 


Name : 


JODIfVl'lUj / U 


Len : 


ion 

129 


Name : 


99BW3932~1 


Len : 




Name : 


99BW4642~4 


Len: 


129 


Name : 


99BW4745 8 


Len: 


129 


Name : 


99BW4754__7 


Len: 


129 


Name : 


99BWMC16_8 


Len: 


129 


Name : 


A2_CD_97CD 


Len: 


129 


Name : 


A2_CY_94CY 


Len: 


129 


Name : 


A2D 97KR 


Len: 


129 


Name : 


A2G_CD_97C 


Len: 


129 


Name: 


A_BY 97BL0 


Len: 


129 


Name : 


A_KE_Q23_A 


Len: 


129 


Name : 


A_SE_SE659 


Len: 


129 
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cnecK : 


4903 


Weight 


: 1 


.00 


CnecJc : 


5102 


Weight 


: 1 


.00 


cnecK : 


5815 


Weight 


: 1 


.00 


Check : 


4144 


Weight 


: 1 


.00 


Check : 


5298 


Weight 


: 1 


.00 


Check : 


3871 


Weight 


: 1 


.00 


Check : 


4976 


Weight 


: 1 


.00 


Check: 


5775 


Weight 


: 1 


.00 


Check : 


6142 


Weight 


: 1 


.00 


Check : 


5055 


Weight 


: 1 


.00 


Check : 


5804 


Weight 


: 1 


.00 


Check : 


5252 


Weight 


: 1 


.00 


Check : 


4995 


Weight 


: 1 


.00 


Check : 


6482 


Weight 


: 1 


.00 


Check : 


4770 


Weight 


: 1 


.00 


Check : 


5384 


Weight 


: 1 


.00 


Check : 


4848 


Weight 


: 1 


.00 


Check : 


5783 


Weight 


: 1 


.00 


cnecJc : 


5364 


Weight 


: 1 


.00 


Check : 


5712 


Weight 


: 1 


.00 


Check: 


5586 


Weight 


: 1 


.00 


Check : 


5299 


.Weight 


: . 1 


.00. 


Check : 


4423 


Weight 


: 1 


.00 


Cneck : 


5415 


Weight 


: 1 


.00 


Check : 


5426 


Weight 


: 1 


.00 


Check : 


2613 


Weight 


: 1 


.00 


Check : 


4597 


Weight 


: 1 


.00 


Check : 


5653 


Weight 


: 1 


.00 


Check : 


4310 


Weight 


: 1 


.00 


Check : 


4675 


Weight 


1 


.00 


Check : 


5079 


Weight 


1 


00 


Check: 


5939 


Weight 


1 


.00 


Check : 


5666 


Weight 


1 


00 


Check: 


5102 


Weight 


1 


00 


Check : 


5675 


Weight : 
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ARKNRRRRWR 
ARKNRRRRWR 
ARKNRRRRWR 
ARRNRRRRWR 
ARKNRRRRWR 
ARKNRRRRWR 
ARRNRRRRWR 
ARRNRRRRWR 
ARKNRRRRWR 
ARRNRRRRWR 
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ADK_CD_MAL 

AG_BE_VI11 

AG_NG_92NG 

AGHU_GA_VI 

AGU_CD_Z32 

AJ_BW_BW21 

B_AU_VH_AF 

B_CN_RL42_ 

B_DE_D3 1_U 

B_DE_HAN_U 

B_FR_HXB2_ 

B_GA_OYI_M 

B_GB_CAM1_ 

B_GB_GB8_A 

B_GB_MANC_ 

B_KR_WK_AF 

B_NL_3202A 

B_TW_TWCYS 

B_US_BC_L0 

B_US_DH123 

B_US_JRCSF 

B_US_MNCG_ 

B_US_P896_ 

B_US_RF_M1 

B_US_SF2_K 

B_US_WEAU1 

B_US_WR2 7_ 

B_US_YU2_M 

BF1_BR_93B 

C_BR_92BR0 

C_BW_96BW0 

CJ3W_96BW1 

C_BW_96BW1 

C_BW_96BW1 

C_ETJETH22 

C_IN_93IN1 

C_IN_93IN9 

C_IN_93IN9 

C_IN_94IN1 

C_IN_95IN2 

CRFO 1_AE_C 

CRF01_AE_C 

CRF01_AE_C 

CRF01_AE_T 

CRF01_AE_T 

CRF01__AE_T 

CRF01_AE_T 

CRF01_AE_T 

CRF01_AEJT 

CRF02_AG_F 

CRF02_AG_F 

CRF02__AG_G 

CRF02_AG__N 

CRF02_AG__S 

CRF02_AG_S 

CRF03_AB_R 

CRF03_AB__R 

CRF04_cpx_ 

CRF04_cpx_ 

CRF04_cpx_ 

CRFO 5 DF B 



MAGRSGD. 
MAGRTGS . 
MAGRSGD. 
MAGRSGA. 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSED . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD. 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGE . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRRGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSGD . 
MAGRSEDS 
MAGRSGD . 
MAGRSGD S 
MAGRSGD S , 
MAGRSGD S , 
MAGRSGDS , 
MAGRSGD S . 
MAGRSGN. . 
MAGRSGS . . 
MAGRSGS . . 
MAGRSGS . . 
MAGRSGS . . 
MAGRSGS . . 
MAGRSGS . . 
MAGRSGS . . 
MAGRSGS . . 
MAGRSGD . . 
MAGRSXD . . 
MAGRSGD. . 
MAGRSGD . . 
MAGRSGN. . 
MAGRSGD. . 
MAGRSGD . . 
MAGRSGD. . 
MAGRSGN. . 
MAGRSG. . . 
MAGRSGS . . 
MAGRSGD . . 



. . SDEDLLRAIR LIKILYQSNP PPNTEGTTRQ ARRNRRRRWR 
. . TDEELLKAVR TIKILYQSNP YPSSEG.SRQ ARKNRRRRWR 
. . ADEELLRVTR IIKILYQSNP YPPPEG.TRQ ARKNRRRRWR 
. . SDEELLKAVR IIKILYQSNP FPE . PTGTRQ ARRNRRRRWR 
. . SDEELLKIVR IIKILYQGNP YPPPEG.TRQ ARRNRRRRWR 
. . NDEQLLLAIR IIKILYKSNP YPKPNG.SRQ ARRNRRRRWR 
. . SDDELLKTVR LIKFLYQSNP PPSPEG.TRQ ARRNRRRRWR 
. - SDEELLKTVR LIKLLYQSNP LPS PEG . TRQ ARRNRRRRWR 
. . SDEELLKTVR LIKFLYQSNP PPSPEG.TRQ ARRNRRRRWR 
. . SDEELLKTVR LIKFLYQSNP PPSNEG.TPT ARRNRRRRWR 
. . SDEELIRTVR LIKLLYQSNP PPNPEG . TRQ ARRNRRRRWR 
. . SDEELLKTVR LIKFLYQSNP PPNPEG . TRQ ARRNRRRRWR 
. . SDEELLKAVR LIKLLYQSNP LPS S KG .TRQ ARRNRRRRWR 
. . SDEDFLKTVR FIKFLYQSNP PPNPKG . TRQ ARRNRRRRWR 
. . SDEELLQTVK LVKFLYQSNP PPSPEG.TRQ ARRNRRRRWR 
. . SDEELLRTIR IIKFLYQSNP LPEPEG . TRQ ARRNRRRRWR 
. . SDEDLLKTVR LIKFLYQSNP PPSPEG.TRQ ARRNRRRRWR 
. . SDEELLRTVR LIKLIYQSNP PPNPEG . TRQ ARRNRRRRWR 
. . SDEELLKTVR LIKLLYQSNP PPNPEG. TRQ ARRNRRRRWR 
. . SDEDLLNTVR LIKLLYQSNP LPSLEG . TRQ ARRNRRRRWR 
, . SDEDLLKTVR LIKFLYQSNP PPSNEG . TRQ ARRNRRRRWR 
. . SDEELLKTVR LIKFLYQSNP PPSSEG.TRQ ARRNRRRRWR 
>. SDEDLLKTVR LIKFLYQSNP PPSLEG . TRQ ARRNRRRRWR 
. . SDEDLLKAVR LIKSLYQSNP PPSPEG.TRQ ARRNRRRRWR 
. . SDEELLRTVR LIKLLYQSNP PPSPEG.TRQ ARRNRRRRWR 
. . SDEDLLKTVR LIKILYQSNP PPSPEG.TRQ ARRNRRRRWR 
. . SDEELLQKV. LIRFLYQSNP PPSSEG . TRQ ARRNRRRRWE 
. - SDEDLLRTVR LIKVLYQSNP PPSSEG.TRQ ARRNRRRRWR 
. . SDTELLKAVS YIKILYQSNP LPKPKG . TRQ ARRNRRRRWR 
, . SDEALLQAVR IIKILYQSNP YPK . PEGTRQ ARRNRRRRWR 
SDEALLQAVR IIKILYQSNP YPK. PEGTRQ AWRNRRRRWR 
. NDEALLQAVR IIKILYQSNP YPK. PEGTRQ ARKNRRRRWR 
. SDEALLQAVR IIKILYQNNP YPK. PEGTRQ ARKNRRRRWR 
. . DEALLHAVR IIKILYQSNP YPE.PKGTRQ ARKNRRRRWR 
. SDEELLKAVR IIKILYQSNP YPT. PEGTRQ ARRNRRRRWR 
. . DEALLQAVR IIKILYQSNP YPE.PKGTRQ ARKNRRRRWR 
. . DEELLKAVR IIKILYQSNP YPE . PRGTRQ ARKNRRRRWR 
. . DEALLRAVR IIKILYQSNP YPE. PRGTRQ ARKNRRRRWR 

• . DEALLKAVR IIKILYQSNP YPE . PRGTRQ ARKNRRRRWR 
. .DEALLKAVR IIKILYQSNP YPE. PRGTRQ ARKNRRRRWR 
. TDEDLLQAVR IIKILYQSNP YPPPEG.TRQ ARKNRRRRWR 
. TDEDLLRTVR IIKILYQSNP YPPAEG . TRQ ARKNRRRKWR 
. TDEELLRAAR AIKILFQSNP YPSSEG . TRQ ARKNRRRRWR 
. TDEELLRAVR IINILYQSNP YPSSEG. TRQ TRKNRRRRWR 
. TDEELLRAVR IIKVLYQSNP YPSSEG. TRQ TRKNRRRRWR 
. TDEELLRAVR IINILYQSNP YPSSEGGTRQ TRKNRRRRWR 
. TDEELLRAVR IIKLLYESNP PPSSEG.TRQ TRKNRRRRWR 
. TDEELLRAVR IIKILYQSNP FPSSEG . TRQ TRKNRRRRWR 
. TDEELLRAVK IINILYQSNP LPSSEG.SRQ TRKNRRRRWR 
. ADEELLRWR IIKILYQSNP YPPPEG.TRQ TRKNRRRRWR 
. ADEELLRWR IVKILYQSNP YPPPEG.TRQ ARKNRRGRWR 
' ADEELLRVIR IIKILYQSNP YPKPEG . TRQ ARRNRRRRWR 

• ADEELLRAVR IIKILYQSNP YPPPEG.TRQ TRKNRRRRWR 
. ADEELLRAVR TIKILYQSNP YPPPEG.TRQ ARKNRRRRWR 
. ADEGLLRAVR IIRILYQSNP YPPPEG.SRQ ARRNRRRRWR 
. SDEELLKTIR LIKFLYQSNP PPSPEG.TRQ ARRNRRRRWR 
. SDEDLLKTIR LIKFLYQSNP PPNPEG. TRQ ARRNRRRRWR 
. IDEDLFKAAR AIKILYQSNP YPNNPTGTRQ ARRNRRRRWR 
. SNEDLLGSVG IVKILYQSNP YPN . PTGTRK ARRNRRRRWR 
. TDEDLLKAVG IVKILYQSNP YPNNTAGTRQ ARRNRRRRWR 
. RDEDLLKAVR LIKILYQSNP LPS PEG . TRQ ARRNRRRRWR 
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CRF05_DF__B MAGRSGD . . . RDEDLLKAVR 
CRF06_cpx_ MAGRSGD. . . SDDRLLLAVR 
CRF06_cpx_ MAGRSGD... NDEQLLLAVR 
CRF06_cpx_ MAGRSGD... NDEQLLLAVR 
CRF06_cpx_ MAGRSGD... SDEQLLWAVR 
CRFll_cpx_ MAGRSGD... NDEQLLTAVK 
CRFll_cpx_ MAGRSGD... SDAQLLAAAR 

D_CD_84ZR0 MAGRSGD... SDEDLLTAVR 
D__CDJSLI_K MAGRSGD . . . SDEDLLRAVR 

D_CDJNDK_M MAGRSGD . . . SDENLLKAIR 

D_UG_94UG1 MAGRSGD . . . RDEELLQAVR 

F1_BE_VI85 MAGRSGD... SDTELLRAVR 

F1_BR_93BR MAGRSGD... SDQELLRAVR 

F1_FI_FIN9 MAGRSGD... SDTELLKAVK 

F1_FR_MP41 MAGRSGD... NDEELLRAVR 

F2_CM_MP25 MAGRSGD... RDEELLKAVR 

F2KU_BE_VI MAGRSGD... SDEELLKAVR 

G_BE_DRCBL MAGRSGS . . . TDEELLTAVR 

G_NG_92NG0 MAGRSGD . . . PDEELLRAVR 

G_SE_SE616 MAGRSGS... TDEELLRAVR 

H_BE_VI991 MAGRSGD. . . NDEGL LRACR 

H_BE_VI997 MAGRSGA. . . GDEQLPQVCR 

H_CF_90CF0 MAGRSGA. . . SDTELLQVCR 

J_SE__SE702 MAGRSGD... NDDQLLLAVR 

J_SE_SE788 MAGRSGD... SDDQLLLAVR 

K_CD_EQTB1 MAGRRGD . . . SEQQLLTPVR 

K_CM_MP535 MAGRRGD. . . PDEQLLTTVR 

N_CM_YBF3 0 MAGRSGVN. . . DEELLRAVR 

0_CM_ANT7 0 MAGRSED ... . DQ . LLQAIQ 

0_CM_MVP5 1 MAGRSEE ... . DQQLLQAIQ 

0_SN_MP12 9 MAGRSDG ... . DQPLLRAIQ 

OJBN_MP130 MAGRSDG... .DQPLLRAIQ 

U _CD 83C MAGRSGE... SDEELLRAVR 

51 

00BW0762_1 ARQRQIHSIS ERILSTVLGR 

00BW0768_2 ARQRQINSIS ERILSTCLGR 

00BW0874_2 ARQRQINSIS GRILSACLGR 

00BW1471_2 ARQRQIRAIS ERILDTCLGR 

00BW1616_2 ARQRQIHSIS ERILSACLGR 

00BW1686_8 ARQRQIHSIS ERILSTCLGR 

00BW1759_3 ARQRQIDSIS ERVLSTVLGR 

00BW1773_2 ARQRQIREIS QRILSTYLGR 

O0BW1783__5 ARQRQIHSIS ERILSTCLGR 

00BW1795_6 ARQRQIHSIS ERILSTCLGR 

00BW1811_3 ARQRQIREIS ERILCTCLGR 

00BW1859_5 ARQRQIHTIS ERILSTCLGR 

00BW1880_2 ARQRQIHSIS ERILSTCLGR 

00BW1921_1 ARQRQINSIS ERILTTCLGR 

00BW2036_1 ARQRQIDSIS ARILSTCLGR 

00BW2063_6 ARQRQIHSIS ERILSTCLGR 

00BW2087_2 ARQREIHSIS ERILTTVLGR 

00BW2127_2 ARQRQIHSIS ERILSTCLGR 

00BW2276_7 ARQRQIHTIS ERILSTCLGR 

00BW3 819_3 ARQRQIHSIS ERILSACLGR 

00BW3842_8 ARQRQIDSIS GRLLSTCLGR 

00BW3871J3 ARQRQINSIS ERILSTCLGR 

00BW3876_9 ARQKQIHSIS ERILSACLGR 

00BW3 886_8 ARQRQINSIT ERILSDCLGR 

00BW3 891_6 ARQRQIHSLS ERILSTCLGR 

00BW3970_2 ARQKEIHSLS ERILSTCLGR 



LIKFLYQSNP PPRPEG..TRQ ARRNRRRRWR 
IIKILYQSNP YPKPNG.SRQ ARRNRRRRWR 
IIKILYQNNP YPKPNG . GRQ ARRNRRRRWR 
TIKILYQSNP YPKPSG.SRQ ARRNRRRRWR 
VIKILYQSNP YPKLSG.SRQ ARRNRRRRWR 
IIKILYQSNP QPNPTG.SRQ ARRNRRRRWR 
IIKILYQSSP YPKPAG.TRQ ARRNQRRRWR 
LIKILYQSNP PPSPEG.TRQ ARRNRRRRWR 
LIKFLYQSNP PPSPEG.TRQ ARRNRRRRWR 
LIKFLYQSNP PPSPEG.TRQ ARRNRRRRWR 
LIKILYQSNP PPSPEG.TRQ ARRNRRRRWR 
CIKILYQSNP YPKPEG.TRQ ARRNRRRRWR 
YIKILYQSNP YPKPEG.TRQ ARRNRRRRWR 
YIKILYQSNP YPSPDG.TRQ ARRNRRRRWR 
AIKILYQSNP YPKPEG.TRQ ARRNRRRRWR 
YIKILYQSNP YPKLEG.TRK ARRNRRRRWR 
LIKILYQSNP YPKPEG.TRQ ARRNRRRRWR 
IIKLLYQSNP SPPPEG.TRQ ARRNRRRRWR 
IIKTLYQSNP YPSPAG.TRQ ARRNRRRRWR 
AIKILYQSNP YPPPEG.TRQ ARRNRRRRWR 
IIRLLYQSNP YPE.PAGTRQ AQRNRRRRWR 
IIKIIYQSNP YPE.PAGTRQ ARRNRRRRWR 
IIKILYQSNP CPE . PTGTRQ ARRNRRRRWR 
IIKILYQSNP YSKPNG.SRQ ARRNRRRRWR 
LIKILYQSNP YPKPNG.SRQ ARRNRRRRWR 
IIKILYQSNP YPKPEG.TRQ ARRNRRRRWR 
TIKI LYE SNP YPNLEG.SRQ TRRNRRRRWR 
VIKILYQSNP YPNSKG.TRQ ARRNRRRRWR 
IIKILYQSNP QPSPRG.SRN ARRNRRRRWR 
IIKILYQSNP CPTPAG.SRN ARRNRRRRWR 
IIKILYQSNS HTSPTG.SRS ARRNRRRRWR 
IIRILYQSNP HPSPTTGSTS ARRNRRRRWR 
IIRILYQSNP PPNPEG.TRQ ARRNRRRRWR 

100 

PAEPVPFQLP PIERLHIGCS ESGGTSGTQQ 
PAEAVPLQLP PIERLHIGCN ESGGTSGTQQ 
PTEPVPFQLP PIERLHINCS ENGGTSGTQQ 
LTEPVHLPLP PLERLHLDCS EDCEPTGTEQ 
SAEPVPFQLP PIERLHIDCS ESSGNSG. 
SAEPVPLQLP PIERLHIDCS ESGGTFGTQQ 
PTEPVPFQLP PIERLDIGDS ESGGTSGTER 
PAEPVPLLLP PIERLHIDCS ESGGTSGTQQ 
STEPVPFQLP PIERLHIGDS KSSGTSGTQQ 
PAEPVPFLLP PLERLHIGDS ESSGTSGTQQ 
STEPVPFLLP PIERLHIGDS EGSGTSGTQQ 
PAEPVPLQLP PIERLHIDCS ESSGTSGTQQ 
STEPVPFQLP PIERLHIGDS ESSGTSGTQR 
SEEPVPLQLP PIERLNIGGS ESSGTSGTQQ 
PAEPVPFQLP PIERLNIGDN ESGGTSGTQQ 
SEEPVPLLLP PIERLRIGDS ESSGTSGTQQ 
SAEPVPFQLP PIERLNINCS EGSGTSGTQQ 
STEPVPFQLP PIERLNIGDS * EGSGTSGTQR 
PAEPVPLQLP PIERLYIGDS EGGGTSGTQQ 
PAEPVPFQLP PIERLNIGDS EGDRTYGTQQ 
SAEPVPLQLP PIERLNIGDS ESGGTSGTQQ 
SAEPVPLQLP PIERLHIDCS ESGGTSGTQQ 
SAEPVPLQLP PIERLHIGGS ESGGTSGTQQ 
SAEPVPLQLP PIERLHIVDS ESGGTSGTQQ 
SAEPVPLQLP PLERLHIGDS ESGGTSGTQQ 
PAEPVPLQLP PLERLHIDCS ESGGTSGTQ. 
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00BW5031_1 ARQRQIHSIG ERILSTCLGR 

96BW01B21 ARQRQIHSIS ERILTTCLGR 

96BW0407 ARQRQIHSIS ERILSTCLGR 

96BW0502 ARQRQIHSLS ERILSACLGR 

96BW06_J4 ARQRQINSIS ERILSTCLGR 

96BW11_06 TRQKQINSIS ERILSTCLGR 

96BW1210 ARQRQIHSIS ERILSTCLGR 

96BW15B03 ARQRQIDSIS TRILSTCLGR 

96BW16_26 ARQRQIRAIS ARIFSACLGR 

96BW17A09 ARQRQIHSIS ERILSTCLGR 

96BWM01_5 ARQRQINSIS ERILSACLGR 

96BWM03_2 ARQRQIHSIS ERILSACLGR 

98BWMC12_2 ARQKHIHSIS ERILSDCLGR 

98BWMC13_4 ARQRQINSIS ERILSTCLGR 

98BWMC14_a ARQTQINSLS ERIVSTCLGR 

98BWM014_1 ARQRQIRALS ERILSTVLGR 

98BWMO!8_d ARQRQINSIS ERILSTCLGR 

98BWM036_a ARQRQIHSIS ERILSSCLGR 

98BWM037_d ARQRQIHSIS ERILSTCLGR 

99BW3932_1 ARQRQIHSIS ERILFTCLGR 

99BW4642_4 ARQRQISSLS ERILSTCLGR 

99BW4745_8 ARQNQIRAIS KRILSTCLGR 

99BW4754_7 ARQRQIHSIS ERILSNCLGR 

99BWMC16_8 ARQRQIDSIS QRILSDCLGG 

A2_CD_97CD ARQRQIDSIS ERILSTCLGR 

A2_CY_94CY ARQRQIDSIS ERVLRTCLGR 

A2D 97KR ARQRQIRAIS ERILSACLGR 

A2G_CD__97C ARQRQIREIS ERILSTCLGR 

A_BY_97BL0 ARQRQINSIS KRIFSTCXGR 

A_KE_Q23_A ARQRQIDSIS ERILSTCLGR 

A_SE__SE659 ARQRQIDSIS ERILSACLGR 

A_SE_SE725 ARQRQIDSIS ERILSTCLGR 

A_SE_SE753 ARQRQIDSIS ERILNACLGG 

A_SE_SE853 ARQRQIDSIS ERILSTCLGR 

A_SE_SE889 ARQKQIDSLS ERILSTCLGR 

A_JSE_UGSE8 ARQRQIDSLS QRILSACLGR 

A_UG_92UG0 ARQRQIDTLS ERVLSTCLGR 

A_UG_JJ455_ ARQRQIDSLS ERILSDCLGR 

AC_IN_2130 ' ARQRQIDSIS ERILSTFLGR 

AC_RW_92RW ARQRQIHSIS ERILSTCLGR 

AC_SE_jSE94 ARQRQIDSIS ERILSTCLGR 

ACD_SE_SE8 ARQRQIDSIS QRILSTCLGR 

ACG__BE__VI1 ARQRHIHSLS ERILCTCLGR 

AD_SE_SE69 ARQRQINSIG ERILSTYLGR 

AD_SE_SE71 ARQNQIDSIS KRILSNCLGR 

ADHK_NO_97 ARQXQIHSIG ERVLATCMGR 

ADK_CD_MAL ARQRQINSIG ERILSTYLGR 

AGJBE_VI11 ARQRHIQAIS RRILDACLGR 

AG__NG__92NG ARQRQISALS ERILSTCLGR 

AGHU_GA_VI ARQKQIHSIG ERVLATYLGR 

AGU_CD_Z32 ARQRQIHSLG ERILTTCLGR 

AJ_BW__BW21 ARQNQIDSIS ERILSTCLGR 

B_AU_VH_AF ARQRQIRQIS GWILSTYLGR 

B_CN_RL42_ ARQRQIREIS DRILVTYLGG 

B_DEJD31_U QRQRQIQSIS ERILSTYLGR 

B_DE_HAN_U ERQRQIRSIS ERILSTFLGR 

B_FR_HXB2_ ERQRQIHSIS ERI LGTYLGR 

B_GA_OYI_M ERQRQIRKIS GWILSTYLGR 

B_GB_CAM1_ ERQRHIRAIS NWILSTHLGR 

B_GB_GB8_A ARQRQIHQIG EWILSAFLGR 

B_GBJV[ANC_ GRQRQIQSLS AWILSTRLGR 



SAEPVPLQLP PLERLHIDCS ESSGTPGTQQ 
FTEPVPFQLP PIERLHIGDS ESGGTSGTQQ 
PTEPVPFQLP PIERLHIDCS ESSGASGTQR 
PAEPVPFQLP PIERLHIDCS KSVGTSG. . . 
SAEPVPFQLP PIERLHIGGS ESGGTSGTQQ 
SAEPVPFLLP PIERLHISDS ESSGTSGTQQ 
PAEPVPLQLP PIERLHIGGS ENSGTTGTQQ 
PEEPVPFQLP PIERLNIGDS ESGGTSGTQQ 
PAEPVPLQLP PIERLHIGSS ESGGTSGTQ . 
PTEPVHLPLP PLERLHLDCS EDCEPTGTEQ 
PAEPVPFLLP PIERLHIGDS ESSGTSGTQQ 
PTEPVPFQLP PIERLSIGDS ESSGTSGTQQ 
SAEPVPLQLP PIERLHIDCS ESGGTSGTQQ 
STEPVPFLLP PIERLHIGDS ESSGTSGTQQ 
PAERAPFQLP PIEKLHIDCS ESGGTPG. . . 
PTEPVPFQLP PIERLNINCS ESSGTSGTQQ 
PAEPVPFQLP PIERLHIDCS ESSGTSGTQH 
PTEPVPFQLP PIERLHIGDS ESGGTYGTQQ 
SAEPVPLQLP PIERLHIADS KGGGTSGIQQ 
STEPVPFQLP PIERLNIGDS ESGGTYGTLQ 
SAEPVPLQLP PIERLHIDCS ESSGTSGTQQ 
PAEPVPLQLP PLERLHIGDS ESGGTSGTQQ 
PAEPVPFQLP PIEGLHIDCN ESSGTSGTQQ 
PAEPVSFQLP PIERLNIDCN ESGGTSGTQQ 
PTEPVPLQLP PLERLHLDCS EDCGTCGTQQ 
PTEPVPLQLP PLERLHIDCS EDCGTSGTLQ 
PTEPVPLQLP " PLERLHLDCS EDCGTSGTQQ 
PTEPVPLLLP PLERLHLDCS EDGGTAETQQ 
SXEPVSLQLP PIERLHLDCS EDCXTSETQQ 
PTEPVPLQLP PLERLHLDCC EDCGTSGTQQ 
STEPVPLQLP PLERLNLDCR EDCGTSGTQQ 
SEEPVPLQLP SLETLHLDCH DDCGTSGTQQ 
STEPVPLQLP PLERLNLDCR EDCGTSGTQQ 
SAEPVPLQLP PLEGLHLDCC EDCGTSGTEG 
PQEPVPLLLP PIERLHLDCS EDCGTSGTQQ 
SEEPVPLQLP PLERLHIDCH EDCGTSGTG. 
PAEPVPLQLP PIERLHLDCS EDCGTSGTQQ 
PAEPVPLQLP PIERLRLDCS ESCGTSGTQQ 
SAEPVPLQLP PLERLHLDCQ EDCGTSGTQQ 
PTEPVPFQLP PIERLTIDCS EDGGTSGTQQ 

SAEPVPLQLP PLERLHLD SGTQQ 

SEEPVPLQLP PLERLNLDCC EDCGTSGTQG 
SEEPVHLPLP PLEGLTLDCN ESSGTSGTEG 
SQEPVPLQLP PLERLTLNCI EDCGTSGTQG 
PAEPVPLQLP PLERLNLNCS KDCGTSGTQG 
PAEPVPLQLP PLERLTLDSS EDCDIAGKQG 
PEEPVPLQLP PLERLTLNCN EDCGTSGTQG 
PAEPVPLQLP PLERLSLDCS KDIGTSGTQR 
PAEPVPLQLP PIERLSLDCS EDSRTPETQQ 
PAEPVPLQLP PLERLTLDCS EDCGTSGEKG 
STEPVPFLLP PIERLRIDCS EDRGDSDPQG 
PTEPVPFQLP PIERLRLDCS EDCGHSGTQG 
PAEPVPLQLP PLERLTLDCS KDCGTSGTQG 
STEPVPLQLP PLERLTLDCS KDCGTSGTQG 
PEQPVPLPLP PLERLTLDCS EDCGTSGTQG 
PAEPVPLQLP PLERLTLDCS EDCGNSGTQG 
SAEPVPLQLP PLERLTLDCN EDCGTSGTQG 
SAEPVPLQLP PLERLNLDCS EDCGTSGTQG 
PAEPVPLQLP PLERLTLDCS KDCGTSGTQG 
PAEPVPLQLP PIERLTLDCD EDCGTSGTQG 
STQPVPLQLP PLERLTLDCS EDCGTSGTQG 
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B_KR_WK_AF RRQWWIQSLS GWILNTHLGR 

B_NL_3202A ERQRQIRSIS ERILSTYLGR 

B_TW_TWCYS ERQRQIRTIS GWILSNYLGR 

B_US_BC_LO ERQRQIRSIS ERILSTFLGR 

B_US_DH123 QRQRQIQSIS GWILSNHLGR 

B_US_JRCSF ERQRQIRTIS ERILSTYLGR 

B_US_MNCG_ ERQRHIRSIS AWILSNYLGR 

B_US_P896_ ERQRQIRSIS ERILGTFLGR 

B_US_RFJVI1 ERQRQIRRCS E W I LDTYLGR 

B_US_SF2_R ERQRQIRSIS GWILSTYLGR 

BJCJS_WEAU1 ERQRQIRKIS G W I LNTYLGR 

B_US_WR27_ . RQRQIQSLS AW 1 1 STHLGR 

B_US_YU2_M ERQRQIRSIS GWLLSNYLGR 

BF1_BR_93B ARQRQIREIS ERILSSCLGR 

C_BR_92BR0 ARQRQIHSIS ERILSTCVGR 

C_BW_96BW0 ARQRQIHSIS ERILSTCLGR 

CJBW_96BW1 ARQKQINSIS ERILSTCLGR 

C_BW_96BW1 ARQRQIHSIS ERILSTCLGR 

C_BW_96BW1 ARQRQIDSIS TRILSTCLGR 

C_ET_ETH22 ARQRQIHTLS ERILSNFLGR 

C_IN_93IN1 ARQRQIHSIS ERILSTCLGR 

C_IN_93IN9 ARQRQIHSLS ERILSACLGR 

C_IN_93IN9 ARQKQIHSLS ERILSTCLGR 

C_IN__94IN1 ARQRQIHSIS ERILSACLGR 

C_IN_95IN2 ARQRQIHSIS ERILSTFLGR 

CRF01_AE_C RRQRQIHSLS ERILVACVGR 

CRF01_AE_C ARQRQIHKIG ERILSTCLGR 

CRF01_AE_C ARQRQIRALS ERILSACLGR 

CRF01__AE_T ARQRQIRAIS ERILITCLGR 

CRF01__AE_T ARQRQIRAIS ERILNACVGR 

CRFO 1__AE JT ARQRQIRAIS ERILSTCLGR 

CRF01_AE_T ARQRQIREIS ERILSSCVGR 

CRFO 1_AE JT ARQRQISAIS ERILSTCLGR 

CRF01_AE_T ARQRQISAIS ERILSACLGR 

CRF02_AG_F ARQRQIRAIS ERFLSTCLGR 

CRF02_AG_F ARQRQIRAIS QRILSTCLGR 

CRF02_AG_G ARQRQIHSLS ERILSTCLGR 

CRF02_AG_N ARQRQIRAIS ERILSTCLGR 

CRF02_AGJS ARQRQIRAIS ERILSTCLGR 

CRF02_AG_S ARQRQVRAIS ERILSTCLGR 

CRF03_AB_R ERQRHIHSIS EQILSTYLGR 

CRF03_AB_R ERQRHIHSIS QRILSTYLGR 

CRF04_cpx_ ARQKQIHSLS ERILATYLGR 

CRF04_cpx_ ARQRQIHSIS ERVLATYLGR 

CRF04_cpx_ ARQNRIHSIS ER I LAACLGR 

CRF05JDFJB ARQRQINSIG ERLLS TYLGR 

CRF05_DF_B ARQRQIRSIA DRIVDTYLGR 

CRF06_cpx_ ARQNQIDSIS ERVLSTCLGR 

CRF06_cpx_ ARQNQIDSIS ERILSTCLGR 

CRF06_cpx_ ARQRQIDSIS ERILSTCLGR 

CRF06_cpx__ ARQNQIDSIS ERILSSCLGR 

CRFll_cpx_ ARQNQIDSIS QRILSDCLGR 

CRFll_cpx_ ARQNQLHSIS QRILSTCLGR 

D_CD_84ZR0 ARQRYIHSIG ERILSTYLGR 

D_CD__ELI_R ARQRQIREIA ERILGTYLGR 

D_CD_NDR_M ARQRQIHSIG ERIICTFLGR 

D_UG_94UG1 ARQRQIHSIG ERIISTYLGR 

F1_BE_VI85 ARQRQIRALS DRILSSCLGR 

F1JBR_93BR ARQRQIREIS DRILSSCLGR 

Fl_FI_FIN9 ARQRQIRAIS ERILSSCLGR 

Fl_FR_MP41 ARQRQIRSIS ERILVACLGR 



PAEPVPLQLP PLERLTLDCN EECGTSGTQG 
SAEPVPLQLP PLERLTLDCD EDCGTSGTQG 
PAEPVPLQLP PLERLTLDCD EDCGTSGTQG 
SAEPVPLQLP PLERLNLGCN EDCGTSGTQG 
PADAVPLQLP PLERLTLDCN EDCGTSGTQG 
PAEPVPLQLP PLERLTLDCN EDCGTSGTQG 
PAEPVPLQLP P . QRLTLDCS EDCGTSGTQG 
FEEPVPLPLP PLERLTLDCN EDCGTSGTQG 
SVDPVQLQLP PLERLTLDSS EDCGTSGTQG 
SAEPVPLQLP PLERLTLDCS EDCGNSGAQG 
PTEPVPLPLP PLDRLTLDCK EDCGTSGTQG 
PAEPVPLQLP PLERLTLDCS EDCGTSGTQG 
PTEPVPFQLP PLERLTLDCN EDCGTSGTQG 
PEEPVPLQLP PLERLHINCS EDCGQGTEEG 
PAEPVPFQLP PIERLNINCS ESGGTSGTQQ 
PTEPVPLQLP PIERLHIDCS ESSGASGTQQ 
SAEPVPFLLP PIERLHISDS ESGGTSGTQQ 
PAEPVPLQLP PIERLHIGGS ENSGTTGTQQ 
PEEPVPFQLP PIERLNIGDS ESGGTSGTQQ 
PAEPVPLQLP PLERLNLDCS EDSGTSGTQQ 
STEPVPLQLP PIERLHIGGS ESGGTSGTQQ 
PAEPVPLQLP PLERLHISGS ESGGTSGTQQ 
SAEPVPLQLP PLERLHISGS ESGGTSGTQQ 
PAEPVPLQLP PIERLHISGS ESGGTSGTQQ 
PAEPVPLQLP PIERLHISGS ESAGTSGTPQ 
STEPVPLQLP PLERLHIDCS EDCGTSGTQQ 
SPEPVPLQLP PLERLHLDCS EDCGTSGTQQ' 
SAEPVPLQLP PLERLHLDCS EDCGTSGTQQ 
STEPVPLQLP PLERLHLDCN EDCGTSGTQQ 
STEPVPLQLP PLERLHLDCS EDCGTSGTQQ 
STEPVPLQLP PLERLHLDCS EDCGTSGTQQ 
STEPVPLPLP PLERLHLDCS EDCGTSGTQQ 
STEPVPLQLP PVERLNLDCS EDGGTSGTQQ 
STEPVSLPLP PLERLHLDCS EDCGTSGTQQ 
SAEPVPLQLP PIERLCLDCS EGCGTSGTQQ 
SAEPVPLQLP PLERLCLDCS EGCGTSGTQQ 
PEEPVSFQLP PLERLNLDCS EDCGNSGTQS 
SAEPVPLQLP PIERLNLDCS EDCGTSGTQL 
SAEPVPLQLP PIERLRLDCS EDCGTSGTQG 
PAEPVPLPLP PIERLCLDCS EDSGTSGTQQ 
PEEPVLLHLP PLERLTLDCS EDCGTSGTQG 
PEEPVPLHLP PLERLTLDCS EDCGTSGTQG 
PAEPVPLQLP PLERLTLNCS EDCGTSGDRG 
PAEPVPLQLP PLERLTLNCS EDCGTSGEKG 
PAEPVPLQLP PIERLTLDCS EDCGTSGDRG 
SEEPVPLQLP PLERLNLNCS EDCGTSGTQG 
PEEPVPLQLP PLERLNLNCS EDCGTSGTQG 
SAEPVPLQLP PIERLRLDCS EDCGNSGTQG 
PTEPVPFQLP PIERLRLDCS EDCGNSGTQG 
SAEPVPLQLP PIERLRLDCS EDCGNSGTQG 
SEEPVPLQLP PIERLRLDCT EDCGNSGTQG 
SEEPVPLQLP PIERLHLDCS EDCGNPGTQG 
SEEPVPLPLP PIERLHLDCS EDCGNSGTQG 
SEEPVPLQLP PLERINLNCS EDCGTSGTQG 
PAEPVPLQLP PLERLNLNCS EDCRTSGTQG 
PEEPVPLQLP PLERLNLNCS EDCGTSGTQG 
FEEPVPLQLP PLERLNLNCS EDCGTSGTQG 
SEEPVPLQLP PLERLHINCS EDCGQGPEEG 
PAEPVPLQLP PLERLHINCS EDCGQGAEEG 
LEEPVPLQLP PLERLHINCS EDCGQGTEEG 
PEEPVPLQLP PLERLHINCS RDCGQGTNEG 
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F2_CM_MP25 
F2KU_BE_VI 
G_BE_DRCBL 
G_NG_92NG0 
G_J3E_SE616 
H_BE_VI991 
H_BE_VI997 
H_CF_90CF0 
J_SE_SE702 
J_SE_SE788 
K_CDJBQTB1 
K_CM_MP535 
N_CM_YBF30 
O_CM_ANT70 
Q_CM_MVP51 
0_SN_MP129 
O_SN_MP130 
U CD 83C 



00BW0762_1 
00BW0768_2 
00BW0 874_2 
00BW1471_2 
00BW1616_2 
00BW1686_8 
00BW1759_3 
00BW1773_2 
00BW1783_5 
00BW1795_6 
00BW1811_3 
00BW1859_5 
00BW1880_2 
00BW1921_1 
OOBW2036_1 
00BW2063_6 
00BW2087_2 
00BW2127_2 
00BW2276_7 
00BW3819_3 
00BW3 842_8 
00BW3871_3 
00BW3 876__9 
00BW3886_8 
00BW3891_6 
00BW3970_2 
00BW5031_1 
96BW01B21 
96BW0407 
96BW0502 
96BW06_J4 
96BW11_06 
96BW1210 
96BW15B03 
96BW16_26 
96BW17A09 
96BWM01_5 
96BWM03_2 
98BWMC12J2 
98BWMC13_4 
98BWMC14 a 



ARQRQIHQIS 
ARQRQIHSIS 
ARQRQIHSIS 
ARQRQIHSIS 
ARQRQISAIS 
ARQRQIHSIG 
ARQRQIRAIS 
ARQRQIREIS 
ARQNQIDSIS 
ARQNQIDSIS 
ARQRQIREIS 
ARQKQISSIS 
ARQRQIRAIS 
RRQAQVDTLA 
RRQAQVDS LA 
TRHAHVDTLA 
KRQAQIDTLA 
RRQQQIRSIS 



ERILSTCLGR 
QRILSTCLGR 
ERILSTCLGR 
ERILSACLGR 
ERILTAYLGR 
ERVLATCLGG 
ERILTDCLGR 
ERILTSCLGR 
ERILSSCLGR 
ERIPSSCLGR 
QRVLSSCLGR 
ERLLSACLGR 
ERILSSCLGG 
ARVLATWHG 
TRILATWHG 
ARILATWHG 
ARILATWHG 
ERILSTCLGR 



LQEPVRLQLP 
PAEPVPFQLP 
PEEPVPLQLP 
PAEPVPFQLP 
PAEPVPLQLP 
PAEPVPLQLP 
PPEPVPLQLP 
PPEPVTLQLP 
PAEPVPLQLP 
PAEPVPLQLP 
STEPVPLQLP 
SAEPVPLQLP 
PPEPVDLPLP 
PQNNNIVDLP 
SQDNNLVDLP 
PQDNNLVELP 
PQDNNLVELP 
PAEPVHLQLP 



LLEKLHINCS 
PLERLNLDCS 
PLERLHLDCS 
PLEGLSLDCS 
PLERLHLDCS 
PLERLTLDCS 
PLERLTLDCN 
PLERLTLNCS 
PIERLRLDCS 
PIERLRLDCS 
PLERLSLNCD 
PIEKLNLNCD 
PLDRLTLDTE 
PLEQLSIRDP 
PLEQLNIRDP 
PLEQLSIRDP 
PLEQLSIRDP 
PLERLNLDCS 



EDCGQGTEKG 
EDSREGAEGE 
EDGGTSGTQQ 
KDGGTSGTQQ 
EDSGTSGTQQ 
EDCGTSGEKG 
KDCGTSGEKG 
EDCGTSGEKG 
EDCGNSGTQG 
EDCGNSGTQG 
EDSGQGTEGE 
ED PGKGTEGG 
EDSGTPGTES 
EGDQLSEAWT 
EADRLPGTGT 
DGDQPSGTWT 
DGDQPSGTWT 
KGTATG 



101 129 

PQGTPEGMGN P 

SQGTSEGVGS P 

. S QGTTEGVGN P 

SQGITEGVGS P 

TQGVGS P 

S QGATEGVGN P 

vgs p: 

sqgttegvgs p 

S QGTTEGVGN P 

SQGTPEGVGN P 

SQGTPEGVGN P 

SQGTTEGVGS P 

SQGTPEGVGN P 

S QGTTEGVGN P 

SQGTTEGVGS P 

SQGTPEGVGN P 

PQGTTEGVGN P * 

VGS P 

SQGTTEGVGS P 

SQGTTEGVGS P 

PQGTTEGVGS P 

S QGTTEGVGN P 

SQGTKEGVGS P 

SQGTTEGVGS P 

SQGTTEGVGS P 

GVGH P 

PQGDTEGVGR P 

S QGTTEGVGN P 

S QGTTEGVGN P 

.... TEGVGS P 

SQGPTEGVGS P 

SQGTPEGVGN P 

SQGTTEGVGS P 

SQGTTEGVGS P 

GVGS P 

SQGATEGVGS P 

SQGTPEGVGN P 

SQGTTEGVGS S 

SQGTAEGVGS P 

SHGTPEGVGN P 

TQGVGN P 
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98BWMO!4_l SLGTTEGVGS P 

98BWM018_d SQGTTEGVGN P . 

98BWM036_a PQGTTEGVGN P ]]'.[[ 

98BWM037_d PQGTTEGVGS P . 

99BW3932_1 SQGTTEGVGS P 

99BW4642_ 4 SQGTTEGVGS P * 

99BW4745_8 SQGTTEGVGS P mm [ m 

99BW4754_7 SQGTPEGVGN S ] 

99BWMC16_8 SQGTTEGVGS P ][ 

A2_CD_97CD SQGAETGVGR PQTSVESSGI LGSGIEDX. 

A2_CY_94CY SQGTETGVGR SQESVESSVI LGSGTEEX. 

A2D 97KR PQGTETGVGR PQISVEPSW LGSGTEEX. 

A2G_CD_97C PQGTETGVGG .TIFVESSVI LGSRTKEQX 

A_BY_97BL0 SQXTETXVXX PQISXESSXI XXSGTKEX. 

A_KE_Q23_A SQGAETGVGR HQVSVESPVI LGSGTKNX. 

A_SE_SE659 SQGVETGVGR PQVSGESPVI LGSGTKNX . 

A__SE__SE725 SQGVETGVGR PQVPGEPSTV LGSGTKTX . 

A_SE_SE753 SQGIETGVGR PQVSVESPVI LGSGTKEX. 

A_SE_SE853 VGR PQVSVESPGV LDSGTKNX. 

A_J3E_SE889 SQGAETGVGG PQVSEESSII LGSGTKTX. 

A_SE_UGSE8 TQVSGESSW LDSGTKDX. 

A_UG_92UG0 SQGVETGVGR TQVSGESPW LGSGTKNX. 

A_JJG_U455_ PQGTETGVGG PQISVESSAV LGSGTKNX. 

AC_IN_2130 SQGVETGVGR PQVSVESPGI LGSGTKNX. 

AC_RW_92RW SQGTTEGVGN PVSRKSCAVL GSGTKKEX. 

AC_SE_SE94 SQGTETGVGR PQVSVESSAI LGPGTKNX. 

; ACb_SE_SE8 ...... .VGS NQISVESPAV LDSGTKEX. 

ACG_BE_VI1 VGS SQTSGEHPVI LESGTKEX. 

AD_SE_SE69 VGS PQIPVEPPAV LDSGTKEX. 

AD_SE_SE71 VGS PQIPVESPAI LDSGTENX. 

ADHK_NO_97 VGD PQIPGESSAV LGTGTKEX. 

ADK_CD_MAL VGS PQISVESPAI LGSGTEEX. 

AG_BE_VI11 SQGTETGVGR PQIFVESSGV LGSGTKEX. 

AG_NG_92NG SPGTETGVGG PQISVESPW LGSGTKEX. 

AGHU_GA_VI VGS PQISVESPTV LGTGAKEX. 

AGU_CD_Z32 VGD SQIPGESCDL LGSGTKEX. 

AJ_BW_BW21 ;VGD PQVSGESCPI LGEGTKEX. 

B_AU_VH_AF VGG PQVLVESPAV LESGAAEX. 

B_CN_RL42_ VGS PQILVESPAV LDSGTKEX. 

BJDE__D31_U VGS PQILVESPAV LESGTKEX. 

BJDE_HANJJ VGS PQVLVESPAV LEPGTKEX. 

B_FR_HXB2_ VGS PQILVESPTV LESGTKEX. 

B_GA_OYI_M VGS PEILVESPAV LEPGTKEX. 

B_GB_CAM1_ VGS PQILVESPAV LESGTKEX. 

B__GB__GB8_A VGS PQVLVESPAV LDPGTKEX. 

B_GBJ«ANC_ VGN PQVLVESPAV LESGSKEX. 

BJKR_WK_AF VGN PQILVESPAV LESGTKEX. 

B_NL_3202A VGS PQILVESPAV LESGTKEX. 

B_TW_TWCYS VGS PQIFVESPTV LDSGTKEX. 

B__US_BC_LO VGS PQVLVESPTV LEPGTKEX. 

B_US_DH123 VGT PQILVESPAV LESGTKEX. 

B_US_JRCSF VGN PEILVESPTV LESGTKEX. 

B_USJMNCG_ VGS PQILVESPTV LESGTKEX. 

B_US_P896_ VGS PQILVESPAI LEPGTKEX. 

B_JJSJRF_M1 VGS PQVLVESPAV LESGAKEX. 

B_USJSF2_K VGS PQILVESPAV LDSGTKEX. 

B_US_WEAU1 VGS SQILLESPAV LEPGTKEX. 

B_OS_WR27_ VGD PQILGESPTV LGSGAKEX. 

B_US_YU2_M VGS PQILVESPPV LDSGTKEX. 

BF1_BR_93B VGS PQTSGESRAV LESGTKEX. 

CJBR_92BR0 PQGNTERVGN PVFGRPCAVL ESRVKKEX. 
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C_BW_96BW0 SQGTTEGVGN PVSGKSCAIL GSRAKKEX. 

C_BW_96BW1 SQGTPEGVGN PISGKSCAVL GARAKKEX. 

C_BW_96BW1 SQGTTEGVGS PISGKSCAVL GSGTKKEX. 

C_BW_96BW1 SQGTTEGVGS PVSGKSCAVL GSGTKKEX ! 

C_ET_ETH22 SQGTTEGVGN PISGKPCAVL GSGAKKEX ! 

C_IN_93IN1 L..GS PISGKSCAVL GSGAKKEX. 

C_IN_93IN9 SQGTTERVGS PISGKSCAVL GSGAKKEX. 

C_IN_93IN9 SQGTTEGVGS PISGKSCAVL GYRAKKEX. 

C_IN_94IN1 SQGTTERVGS PISGKSCAVL GSGAKKEX. 

C_IN_95IN2 SQGTTEGVGS PISGKSCTVL GSGAEKEx! 

CRF01_AE_C SQGTETGVGG PQISGESSVI LGSGTKNX. 

CRF01_AE_C STGTETEVGR PQISGESSVI LGSGTKNX. 

CRF01_AE_C SRGTETGVGR PQISGESSVI LGSGTENX. 

CRF01_AE_T SQGTETGVGR PQISGESSVI LGPGTKNX. 

CRF01_AE_T SQGTETGVGR PQISGESSVI LGSGTKNX. 

CRF01_AEJT SQGTETGVGR PQISGESSVI LGPGTKNX. 

CRFO 1_AE JT SQGTETGVGR PQISGESPVI LGPGTKNX. 

CRF01_AE_T SQGTETGVGR PQISGESSVI LGPGTKNX. 

CRF01_AE_T SQGTETGVGR PQISVESSGI LGPGTKNX. 

CRF02_AG_F PQGTETGVGS PPISGESSTI LGSGTKEX 

CRF02_AG_F SQGTETGLGS PQISGESSDI LGAGTKEX ! 

CRF02_AG_G VAD PQIPGESRAI LGSGTKEX. 

CRF02_AG_N SQGTETGVGS PQISVESYII LGSGTKEX. 

CRF02_AG_S VGS PQISVESSIV LGSGTKEX ! 

CRF02_AG_S SQGTETGVGS SQTSVESSVI LGSGTKEX. 

CRF03_AB_R ...VGS PQILVESPTV LDSGTKEX . 

•CRF03_AB_R VGS PQILVESPTV LDSGTKEX. 

CRF04_cpx_ VGS PQVSVELPAV LGTGAKEX . 

,CRF04_cpx_ VGS PQVSVEPPAV LGTGAKEX. 

CRF04_cpx_ VGN PQVPVEPPAV LGTGDKEX. 

CRF05_DF_B VGS PQISVEPPAI LESGTKEX. 

.CRF05_DF_B VGS PQISVESPTV LESGAKEX . 

CRF06_cpx_ VGN PQISGEPDML LGTGTTEX. 

CRF06_cpx_ VGD PQIPGEPGW LGTGTKEX. 

CRF06_cpx_ VGD PQIPVEPGVL LGTGTKEX. 

CRF06_cpx_ VGD PQIPGEPGVL LGTGTKEX. 

CRFll_cpx_ VGD SQISGESDTV LGPRTEEX. 

CRFll_cpx_ VGE SQIPGESSTV LGPRTEEX 

D_CD_84ZR0 VGS PQISVESPAI LESRTEEX. 

D_CD_ELI_K VGH PQISVESPTV LESGTEEQX 

D_CD_NDK__M VGS PQIPVEPPAV LESGTEEX. 

D__UG_94UG1 VGS HQISVESPAV LDSGTKEX. 

F1_BE_VT85 VGS SQISGESHAV LESGTKEX 

F1JBR_93BR VGS SQISGESHTV LGSGTKEX. 

F1_FI_FIN9 VGS PQISGEHHTV LESGTKEX. 

F1_FRJVIP41 VGN PQISMEPRTV LESGTKEX. 

F2_CM_MP25 VGS PQISVESRAV LGSGTKEX. 

F2KU_BE_VI LGN PQIPVEPCAV LGSGTKEX 

G_BE_DRCBL SQGTEIGVGS PQIFVESSW LGSGTKEX 

G_NG_92NG0 PQGTETGVGR PQVLVEPPW LGSGTKEX 

G_SE_SE616 PQGTETGVGR .SIFVESSW LGQGTKEX 

H_BE_VI991 VGS PQTSGESPAV LGTGAKEX. 

H_BE_VI997 KGG PQIPVESSTV LGTGTKEX. 

H_CF_90CF0 EGS PQISLESSTI LGTGTKEX. 

J_SE_SE702 VGD PQISGEPCMV LGAGTKEX. 

0_SE_SE788 VGD PQISGEPCMV LGAGTKEX. 

K_CD_EQTB1 LGS PQIPVEPDTV LGSGDKEX. 

K_CM_MP535 LGS PQISVEPCTV LESGTKEX 

N_CM_YBF30 QQG . TATTET QNTLVGNTCI LGKRVKGX. 

O_CM_ANT70 VDPR.AEDNC LQNLCSCNTI LATRIAEX 

0_CM_MVP51 VDPG.TKDNS LT.LWSCNAI LATRIEKX . 
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0_SN_MP129 VDSG.TEDNC LQTLHSCNTI LATRVAEX. 

O_SN_MP13 0 VDPG . TEDNC LQNLHSCNTI LATRVAEX. 

U_CD 83C VGS TQIPGESCAV LGSGTKE. . 
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Table 16. HIV Tat Alignment 
GCG Multiple Sequence File. 
Written by Omiga 1.1 



Msmp • 

XMCXUtC* • 


nn'Rwfi'7 so n 

w UDWu / o« X 


Xjen : 


108 


XMdlllC • 


sj udviiu / o o £, 


Len : 


108 


Name * 


\J \J Din \J O / *i £ 


Len : 


108 


XMcLULC * 


nnwwi ATI o 


Len : 


108 


iMctmt; • 


KJ KJIj W JL D X D 


Len : 


108 


XMcLlllC • 


UUJDYvXOOO O 


Len: 


108 


Mamo • 
ivctlilc • 




Len : 


108 


N3.IH6 : 


u uon x / / j ,s 


Len: 


108 


Mamo • 
ivjdlllfc: • 


0"RTa7T *7 Q "7 C 
UUDI/yI / O j 3 


Len : 


108 


XMCllILG • 


UUDW1 / JO O 


Len : 


108 


Nstns : 


n nTJixn qi i o 


Len: 


108 


xM dint; . 




Len : 


108 


xvicirne : 


UUolNlOOU 2 


Len : 


108 




uu-owxy^x x 


Len: 


108 


NcLITlG l 




Len: 


108 


XMCltUC • 


UUJDWZUoj O 


Len : 


108 


N3TH6 I 


n fiTawo n q *7 o 
U UiD wz u o / z 


Len : 


108 


XNcLUlC » 


U \J£5 ri/C ±Z / Z 


Len : 


108 


"Warn fa • 


U UDnZZ / D / 


.Len : 


108 


Nairn© : 


U UDW J O X_7 J 


Jjen : 


108 


Name i 


UUDW j O rr Z O 


Len : 


108 


XNCLIUC • 


O fiT*W^ ft 71 O 


. Len : 


1.08 


Name I 




Len : 


108 


Name : 


UUDrV J OOD O 


Len : 


108 


Name Z 


n HRW? ft Q 1 C 


Len : 


108 


XNCLIUG • 


nflRW3 Q7fl 1 


Len : 


108 


Name * 


UUdViDUJI X 


Len : 


108 


XNctuicr . 


J70X5WU XJ5Z X 


Len: 


108 


IMCtUlC . 


jooWUrt U / 


Len: 


108 


XMclIIlC . 




Len: 


108 


XMCllUC . 




Len: 


108 


Name 1 




Len: 


108 


XMdlllC • 


jddWIz xu 


Len : 


108 


Mamp • 

XVCLILIG • 




Len: 


108 


XMdlllG • 


_? D JtJVv X D ZD 


Len : 


108 


XMCtlllC • 


Qfinwi 7 a n o 
" ddw i /Auy 


Len : 


108 


XVCLlllG • 


^7QI3¥Vl v iVJX O 


Len : 


108 


XVCLlttC . 




Len : 


108 


Name : 




juen : 


108 


Name : 




Len : 


108 


Name : 


— ' U i-J rV V\\ — X. *z Ct 


Len : 


108 


Name : 


9 8BWMD14, 1 

^ ODnriUX^c JL 


Xien : 


108 


Name • 

ILL >p* • 


98BWMOT R H 


Len : 


108 


Name : 


— ' uurii iw J vj el 


Len : 


X Uo 


Name : 


98BWM037*~H 


Len : 


XU o 


Name : 


99BW3932~1 


Len : 


108 


Name : 


99BW4642_4 


Len: 


108 


Name : 


99BW4745 8 


Len: 


108 


Name : 


99BW4754_7 


Len: 


108 


Name: 


99BWMC16_8 


Len: 


108 


Name : 


A2 CD 97 


Len: 


108 


Name : 


A2_CY 94 


Len: 


108 


Name : 


A2D 97 9 


Len: 


108 


Name : 


A2G_CD 9 


Len: 


108 


Name : 


A_BY_97_97 


Len: 


108 


Name: 


A_KE_93_Q2 


Len: 


108 


Name : 


A_SE_93_SE 


Len: 


108 
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Check : 


5728 


Weight 


: 1 


.00 


Check : 


4583 


Weight 


: 1 


.00 


Check : 


5462 


Weight 


: 1 


.00 


Check : 


4359 


Weight 


: 1 


.00 


Check : 


5389 


Weight 


: 1 


.00 


Check : 


6742 


Weight 


: 1 


.00 


Check : 


6187 


Weight 


: 1 


.00 


Check : 


5566 


Weight 


: 1 


.00 


Check : 


6579 


Weight 


: 1 


.00 


cnecK : 


6027 


Weight 
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GRKKRRPRR. 
GRKKRRSRR . 
GRKKRRQRR. 
GRKKRRQRR. 
GRKKRRQRR. 
GRKKRRQRR. 
GRKKRRQRR . 
GRKKWRQRR. 
GRKKRRQRR. 
GRKKRRQRR. 
GRKKRRQRR. 
GRKKRRPRR. 
GRKKRRQRH. 
GRKKRRQRP. 
GRKKRRQRH. 
GRKKRRQRR. 



•RAPQDNKNH QVSLSKQPTS RARGD . PTGQ EESKEKVEKE 
•RSPQDSETH QVSLSKQPAS QPRGD . PTGP KESKKKVERE 
'SS^S™ QADLSKQPTS QPRGD. QTGQ KESTKKVERE 
. RSPQHSQTD QASLSKQPAS QPRGD . PTGP KESKKKVETE 
.RPPQDSQTH QVSLSKQPSS QPRGD. PTGP KEQKKKVERE 
. GPPQGSQTH QVSLSKQPTS QPRGD. PTGP KESKEKVERE 
. RAPQDSQTH QASLSKQPAS QSRGD . PTGP TESKKKVERE 
.RAPEDSQTH QVSLPKQPAP QFRGD . PTGP KESKKKVERE 
. RPPQDSQTH QVSLPKQPSS QQRGD . PTGP KESKKKVERE 
.RPPQDSQTH QSSLSKQPTS QLRGD.PTGP TESKKKVERE 
. RAPQDSQTH QASLSKQPTS QPRGD . PTGP KESKKKVERE 
- RAPPEGLTH QVPLSKQPSS QFRGD. PTGP KESKKKWRE 
.RSPQNSQTH QDSLSKQPTS QPRGD . PTGP KESKKKVERE 
.RTPQSSQLH QDPVPKQPAS QAQGN . PTGP KESKKEVESQ 
.SAPPSSEDH QNPIPKQPLP . QTRGDQTGS EESKKKVESK 



.SAPPSSEDH QNPVSKQPLP 
.SAPPSSKDH QNPVSKQPLP 
.SAPPSSEDH QDLVPKQPLS 
. SAPPRSEDH QNLISKQPLP 
. RAPQSSKDH QNLISKQPLS 
.SAPPSSEDH QNLISKQPLP 



. QTRGDPTGL EESKKKVESK 
. QTRGDPTGS KESKKKVESK 
. QARGNPTSS KESKKKVESK 
. RTQGDSTGS EESKKKVESK 
. HTRGDPTGS EESKKKVESK 
. RTQGDPTGS EESKKKVESK 



■«* .^ivvjJJriLiij cJSSKieEeVESK 

•RAPQSSEDH QNLISKQPLP . RTQGDPTGS EESKKKVESK 
■SAPPSSEDH QNLISKQPLP . RTQGDPTGS EESKKKVESK 
i^ SEDH QDLISK Q PL P -RTQGDPTGS EESKKKVEGK 
. SAPQSSEDH QNPISKQPLP . RTPGDPTGS EESKKKVESK 
. GPPPGSKDH QNPIPKQPLP TTRG.NPTGP KESKKEVAKK 
. GPSQDSKDH QNSIPKQPLP TSRG . NPTGP KESKKKVESK 
•GTPQGSKGH QDPISKQPLP IIRG. NPTGP KESKKEVESK 
. GTPQSSKDH QNPIPKQPLP IIRR.NPTDP KESKKEVASK 
. GTPQSRKDH QHPIPEQPLS IIRG.NPTDP KESKKEVASK 
•GTPQSSKDH QSPIPEQPLP IIRG.NPTDP KESKKEVASK 
•GTPQSRKDH QYPIPEQPLP I IRGGNPTDP KESKKEVASK 
•GTPQSSKDH QTPIRKQPPS IIRG.NPTDP KESKKKVESK 
. RTPQSSKDH QYPIPEQPSP IIRG.IPTDP KESKKEVASK 
•GTPQSRQDH QNPVPKQPLP TTRG.DPTDP KESKKEVASK 
. GTPQSRQDR QNPVSKQPLP TTRG.NPTGP KESKREVESK 
. GTPQSHQDH QNPVSKQSLP QTRG.DPTGP KESKKEVESK 
•GTPQSRQDH QNPVPKQPLP TTRG.NPTDP KESKKEVESK 
. GTPQSRQDN QDPVPKQPLP TTRG.NPAGP KESKKEVAGK 
•GTPQSRQDH QNPVPKQPLP TTRG.EQTGP KESKKEVASK 
•RAPQDNQTD QVSLPKQPAS QPRGD. PTGP KE.KKKMERE 
•RPPQDNQTD QVSLPKQPAS QPRGD. PTGP KE.KKKVERE 
•GSLQGSKGH QNLIPKQPLS QQPNGDSTGP EEQKKKVASK 
• DLLGFSRDR QNPIPKQPLS Q.PNGNPEGP KEQKKKVASK 
•RPPQGSRDR QNPIPKQPLS QQHSGDPTGP KEQKEAVASK 
•RPPQGSQAH QDPVPEQPPS QPRGD. PTGP KKQKKEVESK 
•RPPQGGQAH QIPVPEQPSS QARGD.PTGQ KEQKKKVESK 
•QAPPGSKNH QDPVSKQPLS . QTQREQTGP EKSKKEVESK 
•TAPPGSKNH QDPVPKQPLS . QTQRGPTGP EKSKKKVESK 
•TAPPGSKNH QDPVPKQPLS .QTQRKSTGP EESKKEVESK 
•TAPLGSKSH QDPVPKQPLS . QTQRESTGP EKSKEEVESK 
•AASHSSENH QDPIPKQPST . QPNRKPTGP EESKKEVESK 
•TASRSSKNH QDPIPEQPLP . QASRNPTGP EEPKKEVESK 
•GPPQGGQAH QVPIPKQPSS QPRGD. PTGP KEQKKKVESE 
•KPPQGDQAH QVPIPEQPSS QSRGD. PTGP K.KKKKVESE 
. RPPHSSQTH QDPIPKQPSS QPRGD. PTGQ KEKKK. VESK 
•RTPPGGQAN QDPVPKQPSS QPRGN . PTGP KEKKK. VESE 
•RTPQSSQVH QNSLPKQPLS QARGD . PTGP KESKKEVESK 
•RTPQSSQIH QDPVPKQPIS QARGN . PTGP KESKKEVESK 
. RTPQSSQIH QDPVPKQPLS QPRRN . PTGP KESKKEVESK 
• RTPQSSQSH KNPIPEQPLS QARGD . PTGP KESKKEVESK 
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F2_CM_9 5_M GRKKRRQRR . 

F2KU_BE_94 GRKKRRQRR . 

G_BE__9 6__DR GRKKRKHRR . 

G_NG_92_92 GRKKRRPRR. 

G_SE_93_SE GRKKRKHRR . 

H_.BE VI 9 GRKKRRQRR . 

H_BE VI 9 GRKKRSRRR . 

H_CF_9 0__9 0 GRKKRSQRH . 

J_SE_93_SE GRKKRRQRR. 

J_SE_94_SE GRKKRRQRR. 

K_CD_9 7_EQ GREKRRQRT . 

K_CM_96JVIP GRKKRRPRR . 

N_CM_9 5_YB GRKKRSQRR . 

O CM ANT GRKK . . . RGR 

0_CM_91_MV GRKK. . .RRR 

0_SN 99S GRKK... RRR 

0_SN 9 9S GRKK . . . RRR 

U CD 8 3 C GRKKRGKRR . 

101 108 
00BW0762_1 TETDPFD . 
00BW0768_2 TKTDQFD. 
00BW0874__ 2 TKTDQFD. 
00BW1471_2 TEADPCD . 
00BW1616_2 TETDPFD . 
00BW1686_8 TKTDPFD . 
00BW1759_3 TETDRFD. 
00BW1773__2 TETDPD. . 
0 0BW1 7 8 3_5 TETDPFD . 
0 0BW1 7 9 5_6 TETDPFD . 
00BW1811_3 TETDPD. . 
0 OB Wl 8 5 9_5 TETDPYD . 
0 0BW1 8 8 0__2 TETNPFD . 
0 0BW192 1_1 TEADQFD . 
OOBW2036_JL TEADRFD. 
00BW2063_6 TETDPFD. 
0 0BW2 0 8 7_2 TERDPFD . 
0 0BW2 12 7_2 TTTDPFD . 
0 0BW2 2 7 6J7 TETDPYD . 
00BW3819_3 TKTDPFD. 
0 0BW3 8 4 2_8 TETDRFD . 
00BW3871_3 TKTDQFD. 
00BW3876_9 TKADPFD. 
00BW3886_8 AETDQFDY 
O0BW3 891_6 TETDPFA. 
00BW3970_2 TERDPFA. 
O0BW5O31_Jl TETDPFDW 
96BW01B21 TKTDPFD. 
96BW04 07 TEADPFD. 
96BW0502 TEADPFA. 
96BW06_J4 TETDQFD. 
9 6B Wl 1_0 6 TETDQFD . 

96BW1210 TETDPFD. 
96BW15B03 TETDRFD. 
96BW16_26 TETDPCD . 
9 6B Wl 7 AO 9 TEADPFD . 
9 6BWMO 1_5 TKTDQFD . 
9 6B WM03_2 TETDPFD . 
98BWMC12_2 TKAHPFD ! 
98BWMC13_4 TETDQFD. 
98BWMC14_a TDTDQFA. 



. RTPQSGEVH QDPVSKQPLS 
. RTPQSSQAH QNPISKQPLS 
. GTPHSSKDH QTPVPKQPFS 
. GTPQGSKDH QNPVPKQPLP 
. GTPQSSKGH QDPVPKQPLP 
. GTPKSLQDH QTLIPKQPLS 
. ATPAS VQDH QNHIPKQPLS 
. RTPASLQDH QNSISKQPLS 
. SAPPGSKTH QDIiIPKQPLS 
. SAPPGSKNH QDLIPEQPLF 
. TTPYASKNH KDPIPKQPLP 
.TTPYNSENH QDPLRKQPLS 
. RTPQSSKSH QDLIPEQPLS 
PAAAS.HPDH KDPVPKQSPT 
PAAAASYPDN KDPVPEQSLS 
PAAAARHPDN QDIVPEQLTY 
PAAAARNPDN QDIVPEQPPP 
. RTPQSGPNH QNIVSKQPSS 



QTRGD . PKGP, EESKKKVESK 
QARGD.PTGP KEPKKEVESK 
TTRG.NPTGP QESKKEVESK 
ITSG.NPTGS EKPKKEVASK 
TTRG.NPTGP KESKKEVASK 
. RTSGDPTGP EKKKK.VASK 
. RTRGDPTGP KEKKK.VASK 
. RTHGDPTGP KEQKKEVASK 
- QTQRKPTGP EESKKEVESK 
• QTQRKPTGP EESKKEVESK 
. QARGDPTGP KESKKEVESK 
. QPRGEQTDP KESKKKVESK 
- QQQGDQTGQ KKQKEALESK 
ITK.RKQERQ EEQEEEVEKK 
HTG.RKQKRQ EEQEKKVEKE 
ITN.RKQKRQ EEQEKEVENE 
ITNNRKHKRQ EEQEKEVEKE 
QPRGD . PTGQ EEPKKKVEKK 
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98BWM014_1 TETDPCA. 
98BWMO!8_d TETDQFD . 
98BWM036_a TETDPFD . 
9 8BWM03 7_d TETDPFD . 
99BW3932_1 TETDPFD. 
99BW4642_4 TETDQFA. 
9 9B W4 74 5_8 TEPDPCD . 
9 9BW4 7 5 4_7 TETDPFD . 
9 9BWMC1 6_8 TEADRFD . 
A2C D 97 AETDRFD . 

A2_CY 94 AETDRFD . 

A2D 97_9 AETDPCD. 

A2G_CD 9 TETDPD. 

A_BY_9 7_9 7 AETDQFD . 
A_KE_93_Q2 AEADRFD . 
A_SE_93_SE AETDRFD. 
A_SE_94_SE AEADRFD . 
A_SE_94_SE AETDRFD . 
A_SE_95_SE TEADRFD . 
A_SE_95_SE TETDRFA. 
A_SE_9 5__UG AETDRFA . 
A_UG_8 5_U4 AKTDRFA . 
A_UG_92_92 TEADRYA. 
AC_IN_95_2 AKTDRFD. 
AC__RW_9 2_9 TEADPFD . 
AC_SE_9 6_S TETDRFD . 
ACD_SE_9 5_ AETDRFD . 

ACG_BE V TETHPLA. 

AD_JSE_93_S AEADQFDW 
AD_SE_95_S TEPDRFD . 
ADHK_NO_97 TXTDPFDW 
ADK_CD__8 5_ AEADQFDW 

AG__BE VI TETHPGD . 

AG_NG_9 2_9 TETDQCA . 
AGHU G A AEADPFDW 
AGU_CD_76_ TETDPFAW 
Atf_BW_9 8JB AKPDRFD . 
B A U VH TETNPSD . 
B CN RL4 TETDPRD . 
B_DE_8 6_D3 TETDPID . 
B_DE_8 6_HA TEADPFD . 
B_FR_8 3_HX TETDPFD . 
B G A O YI TETDPED . 

B_GB CAM TETHPGD . 

B_GB GB8 TETDPSDW 

B_J3B__5 9_MA TETDPVA. 
B K R W K TWDPVT . 
B_NL_8 6_3 2 TETDPVD . 
B T W TWC TETDPNDQ 

B_XJS DH1 TETDPVH . 

B_US P 8 9 TETDPVH . 

B_US_83_RF TETDPAVQ 
B_US_8 3_SF TETDPFD . 
B_US_84_MN TETHPVD . 
B_US_8 6_JR TETDPDN . 
B _US_8 6_YU TETDPVH . 
B JJS_8 7_BC TETDPVD . 
B_US_8 8_WR TETDPIA . 
B_US_9 0_WE TETDPED . 
BF1 JBR_93_ AKTDPD . . 
C_BR_92_92 TETDPFD . 
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C_BW_96_96 
C_BW_96_96 
C_BW_96_96 
C_BW_96_96 
C_ET_86_ET 
C_IN_93_93 
C_IN_93_93 
C_IN_93_93 
C_IN_94_94 
C_IN_95_95 
CRF01_AE_C 
CRF01_AE_C 
CRF01_AE_C 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE_T 
CRF01_AE_T 
CRF02_AG_F 
CRF02_AG_F 
CRF02_AG__G 
CRF02_AG_N 
CRF02_AG_S 
CRF02_AG_J3 
CRF03_AB_R 
CRF03_AB_R 
CRF04_cpx_ 
CRF04__cpx_ 
CRF04_cpx_ 
CRF05_DF_B 
CRF05_DF_B 
CRF06_cpx_ 
CRF06_cpx_ 
CRF06_cpx_ 
CRF06_cpx_ 
CRFll_cpx_ 
CRFll_cpx_ 
D__CD_83_EL 
D_CD_83_ND 
D_CD_84__84 
D_UG_94_94 
F1_BE_93_V 
F1_BR_93_9 
F1_FI__93_F 
F1_FR_96_M 
F2_CM_95_M 
F2KU_BE_94 
G_BE_96_DR 
G_NG_92_92 
G_SE_93_SE 

H_BE VI 9 

H_BE VI 9 

H_CF_90_90 
JJSE__93_SE 
J_SE_94_J3E 
K_CD_97_EQ 
K_CM_96JVIP 
N_CM_95_YB 
O C M A NT 
O CM 91 MV 



TETDPFD . 
TETDQFD . 
TETDPFD . 
TETDRFD . 
AETDPYA. 
TKTDPFD . 
AKTDPFA. 
TKTDPFA. 
TTSDPFD . 
TKTDPFD. 
AKTDPFA. 
AETDPDW. 
TKTDPCA. 
AETDQCD . 
AETDPCD . 
AETDPCD . 
AETDPCD . 
AETDPD . . 
AETDQCD . 
TETDQGD . 
TKTDPCD . 
TETDPFA . 
TKTDPCD . 
TETDPCD . 
TETGPCD . 
TETHPFD . 
TETHPFD . 
TEADPFA. 
TEADPFD . 
TESNPFD . 
TEADQFDW 
AETDPFDC 
AEPDRFD . 
AEPDRFD . 
AETDRFD . 
TEPDRFD . 
AEPDRFD. 
AEPAPFD. 
AETDPDC . 
AETDPFDW 
TEVHPFDW 
TEADPFDW 
AKTDPCA. 
AKTDPD. . 
AKTDPCD . 
TETDPFD . 
TKTDPSD . 
TETDPLD . 
TETDPFD . 
TETDPLD . 
AEADQCD. 
TETDPFDW 
TEADPCD. 
TETDPD. . 
AEPDRFD. 
AEPDRFD . 
TKTDPD . . 
TKTDQFD. 
TEADPCD. 
AGPGGYPR 
TGPSGQPC 
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0_SN 99S ACP . RYPG 

0_SN 9 9S TGSDRYPR 

U CD 83C TTTDPFD. 
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HIV V±f Se ^nca Alignment 
GCG Multiple Sequence Pile. 
Written by Omiga l.l 



Name : 
Name : 
Name : 
Name: 
Name : 
Name : 
Name : 
Name: 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name: 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name : 
Name: 
Name: 
Name : 
Name : 
Name : 
Name: 
Name : 
Name : 
Name: 
Name : 
Name: 
Name : 
Name : 
Name : 
Name : 
Name : 
Name: 
Name : 
Name : 
Name: 
Name : 
Name: 
Name : 
Name : 



00BW0762_1 
00BW0768_2 
00BW0874_2 
00BW14 71_2 
00BW1616_2 
00BW1686_8 
00BW1759_3 
00BW1773_2 
00BW1783_5 
00BW1795_6 
00BW1811_3 
0 0BW1859_5 
00BW1880_2 
0 0BW1921_1 
OOBW2 036_1 
00BW2063_6 
00BW2087_2 
00BW2127_2 
00BW2128_3 
00BW2276 7 
00BW3 819^3 
00BW3 842_8 
0"0BW3 8*71_3 
0 0BW3 8 7 6_9 
00BW3886_8 
00BW3 891_6 
O0BW3970_2 
00BW5031_1 
96BW01B21 
96BW0407 
96BW0502 
96BW06_CT4 
96BW11_06 
96BW1210 
96BW15B03 
96BW16_26 
96BW17A09 
96BWMO!_5 
96BWM03_2 
98BWMC12_2 
98BWMC13_4 
98BWMC14_a 
98BWM014_1 
98BWMO!8_d 
98BWM036_a 
98BWM03 7__d 
99BW3932_1 
99BW4642_4 
99BW4745_8 
99BW4754_7 
99BWMC16_8 
A2_CD_97CD 
A2_CY_94CY 

A2D 97KR 

A2G_CD_97C 
A_BY_97BL0 
A_KE_Q23_A 



Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len : 

Len: 

Len: 

Len : 

Len : 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len : 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 

Len: 



194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 
194 



Check : 
Check: 
Check: 
Check: 
Check: 
Check : 
Check : 
Check: 
Check : 
Check : 
Check : 
Check : 
Check : 
Check : 
Check : 
Check: 
Check: 
Check : 
Check : 
Check : 
Check: 
Check : 
Check : 
Check: 
Check : 
Check: 
Check: 
Check : 
Check : 
Check : 
Check: 
Check: 
Check: 
Check : 
Check : 
Check: 
Check : 
Check : 
Check : 
Check: 
Check : 
Check: 
Check: 
Check: 
Check: 
Check: 
Check: 
Check : 
Check : 
Check : 
Check: 
Check : 
Check : 
Check : 
Check : 
Check : 
Check: 



4675 
4961 
3755 
3843 
4613 
4096 
3523 
4446 
3151 
4892 
3877 
3290 
2555 
4284 
4019 
4165 
5068 
5231 
5469 
5547 
1251 
4197 
3487 
4432 
5175 
3845 
2268 
3711 
4602 
5108 
4385 
5371 
6037 
4343 
5690 
4471 
3907 
5608 
3079 
5336 
5304 
3984 
2480 
2801 
3762 
4971 
4165 
2912 
5323 
3964 
6325 
5849 
5097 
3871 
5705 
8467 
5053 



Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight ; 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 
Weight : 



1.00 
1. 00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1. 00 



1 

1 

1 

1. 

1. 

1. 

1. 

1. 



.00 
00 
00 
00 
00 
00 
00 
00 



1.00 
1.00 
1. 00 
1.00 
1. 00 
1.00 



1 
1 
1 
1 
1 
1 
1. 
1. 
1. 
1. 
1. 



.00 
.00 
. 00 
.00 
.00 
00 
00 
00 
00 
00 
00 



1.00 

1.00 

1.00 

1.00 

1.00 

1. 00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 

1.00 
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Name 


: A_SE_SE659 


Len 


Name 


: A_SE_SE725 


Len 


Name 


: A_SE_SE753 


Len 


Name 


: A_SE_SE853 


Len 


Name 


: A__SE_SE889 


Len 


Name 


: A_SE_UGSE8 


Len 


Name 


: A_UG_92UG0 


Len 


Name 


: A_UG_U455 


Len 


Name 


: AC__IN_2130 


Len 


Name 


: AC_RW_92RW 


Len 


Name 


: AC_JSE_JSE94 


Len 


Name 


: ACD_SE_JSE8 


Len: 


Name 


: ACG_BE__VT1 


Len: 


Name 


: AD_SE_SE69 


Len: 


Name 


AD_SE_JSE71 


Len: 


Name 


ADHK_NO_97 


Len: 


Name 


ADK_CD__MAL 


Len: 


Name : 


AG_BE_VI11 


Len: 


Name : 


AG_NG_9 2 NG 


Len: 


Name : 


AGHU_GA_VI 


Len: 


Name : 


AGU__CD_Z32 


Len: 


Name : 


AJ_BW_BW21 


Len: 


Name : 


B__AU_VH AF 


Len: 


Name : 


B_CN_RL42_ 


Len: 


Name : 


B_DE_D31_U 


Len: 


Name : 


B_DE_HAN_U 


Len: 


Name : 


B_FR_HXB2_ 


Len: 


Name : 


B__GA_OYI 


Len: 


Name : 


B_GB_CAM1__ 


Len: 


Name : 


B_GB_GB8_A 


Len: 


Name: 


B_GB_MANC_ 


Len: 


Name : 


B_KR_WK_AF 


Len: 


Name : 


BJSTL_3202A 


Len: 


Name : 


B_TW_TWCYS 
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KPPLPSVRKL TEDRWNKPQK TKGHRGSHTM NGH. 
KPPLPSVTKL TEDRWNKPQK TKGHRGSHTM NGH. 
KPPLPSVTKL TEDRWNKPQK TKGHRESHTM NGH. 
KPPLPSVRKL TEDRWNKPQK TKGHRGSHTM NGH 
KPPLPSVTKL TEDRWNKPQK TKGHRGSHTM NGH 
KPPLPSWKL TEDRWNKPQK TKGHRGSHTM HGH. 
KPPLPSVTKL TEDRWNKPQK TKGHRGSHTM NGH. 
KPPLPSVAKL TEDRWNKSHK TKGHRGSHTM NGH. 
KPPLPSVKKL TEDRWNKPQK TKGHRGSHTM NGH* 
KPPLPSVKKL TEDRWNKPQK TKGHRGSHT I NGH* 
KPPFPSVTKL TEDRWNKPQK TKGHRGSHTM TGH. 
KPPLPSVKKL TEDRWNKPQK TKGHRGSHTM NGH. 
KPPLPSVKKL TEDRWNKPQK TKGHRGSHTM NGH. 
KPPLPSVKKL TEDRWNKPQK TKGHRGSHTM NGH. 
KPPLPSVKKL TEDRWNXPQK TKGHRGSHTM NGH. 
KPPLPSVKKL TEDRWNKPQK TKGHRGSRTM NGH* 
KPPLPSVKKL TEDRWNKPQK TKDHRGSHTM NGH. 
KPPLPSVKKL VEDRWNKPQK TRDRRGNHTM NGH. 
KPPLPSVRKL VEDRWNEPQK TRGRRGNHTM NGH. 
KPPLPSVRKL VEDRWNKPQK TRGRRGNHTM NGH 
KPPLPSVRKL VEDRWNKPQK TRGRRGNHTM NGH. 
KPPLPSVRKL VEDRRNKPQK TRGRRGNRTM NGH 
KPPLPSVSKL VEDKWNKPQK TRGRRGNHTM NGH. 
KPPLPSIKKL VEDRWNNPQK IRGRRGNHTM NGH. 
KPPLPSIKKL VEDRWNNPQK IRGRRGNHTM NGH. 
KPPLPSVRKL VEDRWNNPLK TRGRRGNHTM NGH.* 
KPPLPSIKKL VEDRWNNPQK IRGRRGNHTM NGH* 
KPPLPSIKKL VEDRWNNPQK IRGRKGNHIM HGH. 
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CRF01_ 


AE 


C 


CRFOl] 


_AE C 


CRFOl] 


_AE_ 


C 


CRFOl] 


~ae] 


_T 


CRFOl] 


~ae] 


_T 


CRFOl] 


~ae~ 


_T 


CRF01_AE_ 


]t 


CRF01_ 


_AE_ 


"t 


CRFOl] 


~ae] 


"t 


CRF02] 




_F 


CRF02" 


"ag" 


"f 


CRF02__AG_ 


]g 


CRF02_AG] 


"n 


CRF02 AG~ 


]s 


CRF02_ 


_AG_ 


"s 


CRF03] 


_ab] 


~R 


CRF03* 


ab" 


~*R 



CRF04_cpx_ 
CRF04_cpx_ 
CRF04_cpx_ 
CRF05_DFJB 
CRF05_DF_B 
CRF06_cpx_ 
CRF06_cpx__ 
CRF06_cpx_ 
CRF06_cpx_ 
CRFll_cpx_* 
CRFll_cpx_ 
D_CD_84ZR0 
D_CD_ELI_K 
D_CD_JSTDK__M 
D_UG_94UG1 
F1_BE_VI85 
F1JBR_93BR 
F1_FI_FIN9 
F1_FR__MP41 
F2_CM_MP25 
F2KU_BE__VI 
G_BE_DRCBL 
G__NG_92NG0 
G_SE_JSE616 
H_BE_VT991 
H_BE_VI997 
H_CF_90CF0 
J_SE_SE702 
J_SE_SE788 
K_CD_EQTB1 
K_CM_MP535 
N_CM_YBF3 0 
O_CM_ANT70 
0_CM_MVP51 
0_SN_99SE_ 
0_SN_99SE_ 
U CD 83C 



LKALATPKKT RPPLPSVRKL 
LKALTKTKKT KPPLPSVRKL 
IiKALATPKKI RPPLPSVRKL 
LKALTTPKRI RPPLPSVKKL 
LKALTTPKRI KPPLPSVKKL 
LKALTTPKRI RPPLPSVKKL 
LKALTTPKRI RPPLPSVKKL 
LKALTTPKRI RPPLPSV.EI 
LKALTTPKRI KPPLPSVRKL 
LKALVTPAKT KPPLPSVKKL 
LKALVTPVKT KPPLPSVKKL 
LKALVTPTRK KPPLPSVRKL 
LNALVAPTKT KPPLPSVRKL 
LKALVTPTRT KPPLPSVKKL 
LKALVTPTRR KPPLPSVKKL 
LAALRTPKKI KPPLPSVTKL 
LAALRTPKKI KPPLPSVTKL 
LAALISPKKT KPPLPSVKKL 
LAALISPKKT KPPLPSVKKL 
LAALISPKKT KPPLPSVKKL 
LTALITPKKT KPPLPSVRKL 
LTALITPQKI KPPLPSVRKL 
LTALIKPEKR KPPLPSVQKL 
LTAL IKPKKR KPPLPSVQKL 
LTALIKPRKR KPPLPSVQKL 
LKALVKTKRR KPPLPSVQKL 
LKALVTPTRA KPPLPSVRKL 
LKALVTPKRT KPPLPSVRKL 
LTALIAPKKR KPPLPSVKKL 
LTALIAPKQI KPPLPSVRKL 
LAAL I APKKI KPPLPSVRKL 
LTALVTPRKI KPPLPSVGKL 
LTALIAPEKT KPPLPSVQKL 
LTALIAPKKT KPPLPSVQKL 
LTALVS PKKA KPPLPSVKKL 
LTALIAPKKT KPPLPSVKKL 
LTALITPKKI KPPLPSVRKL 
LTALVAPKKT KPPLLSVRKL 
LKVLVAPTRR RPPLPSVRKL 
SKALVTPTRK RPPLPSVGKL 
LKVLVTS KRS RPPLPSVTEL 
LTALISPKRT KPPLPSVRKL 
LTALVAPKKT KPPLPSVKKL 
LTALVAPKKI KPPLPSVRKL 
LTAL I KPKRR KPPLPSVQKL 
LTAL IRPKRR KPPLPSVQKL 
LTALIAPKKT KPPVPSVQKL 
LTALVAPRRP KPPVPSVKKL 
LTAWVGAKKR KPPLPSVTKL 
LRAWKARSR KPPLPSVQKL 
LKAWKVKRN KPPLPSVQRL 
LRVWKEKRN KPPLPSVQKL 
LRVWKEKRH KPPLPSVQKL 
LTTLVAPTKR KPPLPSVRKL 



TEDRWNKPQK TRGHRENPTM NGH. 
TEDRWNKPQK TKGHRES PTM NGH. 
TEDRWNKPQK TRGHRENPTM SGH. 
TEDRWNKPQK IWGHRENPTM NGH. 
TEDRWNKPQK IRDHREYRTM NGH. 
TEDRWNKPQK I KGHRENPTM NGH. 
TEDRWNKHQ . KGDHRENPTM NGH. 
TEDRWNKPQ. KRGHRENPTM NGH. 
TEDRWNE PQK IRGHREYPTM NGH. 
AEDRWNKPQK TRGHRGNRSM NGH. 
AEDRWNKPQK TRGHRGNRSM NGQ. 
AEDRWNEPQK TRGHRGSRPM NGR. 
AEDRWKEPQK TRGHRGSRPM NGH. 
AEDRWNEPQK TRGHRGSRSM NGH. 
AEDRWNEPQK TRGHRGNRSM NGH. 
TEDRWNKPQR TKDHRGSHTM SGH. 
TEDRWNKPQR TKDHRGSHTM SGH. 
VEDRWNKPQK TRGRRENQIM NGH. 
VEDRWNKPQK TRGRRENQIM NGH. 
VEDRWNKSQK TKGRRESHIM NGH. 
TEDRWNKPQK TKGRRGNHTM NGY. 
TEDRWNKPQR TKGHRGCHTM NGY. 
VEDRWNKPQK TRGHRESHTM NGH. 
VEDRWNKPQK TRDHRE SHTM NGH. 
VEDRWNKPQK TRDHRECHTM NGH. 
VEDRWNKPQK TKDHRE SHI M DGH. 
AEDRWNKPQK TRGHRGNHTA NGC . 
TEDRWNKPQK TRGRRGNHTV NGC. 
TEDRWNKPRQ TKGRRGSHTM NGH. 
TEDRWNKPQQ TRGHRGSHTM NGH. 
TEDRWNKPQK TKGRRGSHTM NGH. 
TEDRWNKPQR TKGHRGSHTM NGH. 
VEDRWNKPQE TRGHRGSHTM NGH. 
VEDRWNKPQK TRGHRESHTM NGH. 
VEDRWNKPQE IRGHRGSHTM NGH. 
VEDRWNKPQE TRGHRGSHTM NGH. 
VEDRWNNPQK TRGHRGSHTM NGH. 
VEDRWNKPQK TRDHRGSHTM NGH. 
TEDRWNEPQK TRGHRENPTM NGH. 
AEDRWNKPQK TRDHRENPTM NGH. 
AEDRWNKPQK TRGHRENPTM NGH. 
VEDRWNKPQK TRGHRGSHTM NGH. 
VEDGWNKPQK TRGHRGSHTM NRH. 
VEDRWNKPQK TRGHRGSHTM NGH. 
VEDRWNKPQK TRDHRESHTM NGH. 
VEDRWNKPQK TTGHRESHTM NGH. 
VEDRWNKPQK TRGHRGSHTM SGQ. 
VEDRWNKPQK TRGHRGS QTM NGH. 
TEDRWNEHQK MQGHRGNPIM NGH. 
TEDRWNKHLR IRDQLKSPSM NGH. 
TEDRWNKPWK IRDQLGSHSM NGH. 
TEDRWSRHLR IRDQLESHSM NGH. 
TEDRWSRHLR IRDQLGSHSM NGH. 
VEDRWNKPQK TKGHKGSHTM HGH. 
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Table 18. HIV Vpr Sequence Alignment 
GCG Multiple Sequence File. 
Written by Omiga 1 . 1 



Name 


: 00BW0762_1 


lien: 


100 


Check : 


8179 


Name 


: 00BW0768_2 


lien: 


100 


Check : 


8119 


Name 


: OOBW0874 2 


Len: 


100 


Check : 


7661 


Name 


: 00BW1471 2 


Lem 


100 


Check : 


6614 


Name 


: 00BW1616_2 


Len: 


100 


Check : 


6361 


Name 


: 00BW1686_8 


Len: 


100 


Check: 


6014 


Name 


: 00BW1759_3 


Len: 


100 


Check : 


6894 


Name 


: 00BW1773 2 


Len: 


100 


Check : 


7772 


Name 


: 00BW1783_5 


Len: 


100 


Check : 


7149 


Name 


: 00BW1795_6 


Len: 


100 


Check : 


7614 


Name 


: 00BW1811_3 


Len: 


100 


Check: 


7968 


Name 


: 00BW1859 5 


Len: 


100 


Check : 0 6222 
Check: 6941 


Name 


: 00BW1880_2 


Len: 


100 


Name 


: 00BW1921 1 


Len: 


100 




fll ft** 
O -Loo 


Name 


: 00BW2036 1 


Len: 


100 


VxXXCwJV • 


pi TC 


Name 


: 00BW2063 6 


Len: 


100 


Choreic . 


O / U O 


Name 


: 00BW2087 2 


Len: 


100 


Check : 


70QO 

/ o o o 


Name 


: 00BW2127_2 


Len: 


100 






Name 


: O0BW2128 3 


Len: 


100 




JL / .SO 


Name 


: O0BW2276_7 


Len: 


100 


Check : 


f\ ft 


Name 


: 00BW3819_3 


Len: 


100 


Cht^flc • 


3 O / KJ 


Name 


: 00BW3842_8 


Len: 


100 


Check : 


77ft ft 
/too 


Name 


: 00BW3 871_3 


Len: 


100 


Check • 


8574 


Name 


: 00BW3876 9 


Len: 


100 


Check : 


79 ft 


Name 


: 00BW3886_8 


Len: 


100 


Check • 


6446 


Name 


: 00BW3891_6 


Len: 


100 


Check : 


8629 


Name 


: 00BW3 970_2 


Len : 


100 


Check : 


7113 


Name 


: 00BW5031_1 


Len: 


100 


Check : 


5511 


Name 


: 96BW01B21 


Len : 


100 


Check : 


7551 


Name 


96BW0407 


Len: 


100 


Check: 


8226 


Name : 


96BW0502 


Len: 


100 


Check : 


8242 


Name : 


96BW06 J4 


Len : 


100 


Check : 


7544 


Name : 


96BW11_06 


Len: 


100 


Check : 


7942 


Name : 


96BW1210 


Len : 


100 


Check : 


8580 


Name : 


96BW15B03 


Len : 


100 


Check : 


7308 


Name : 


96BW16_26 


Len: 


100 


Check : 


7009 


Name: 


96BW17A09 


Len : 


100 


Check: 


6492 


Name : 


96BWM01_5 


Len : 


100 


Check : 


5837 


Name : 


96BWM03 2 


Len: 


100 


Check : 


5277 


Name : 


98BWMC12_2 


Len: 


100 


Check: 


7807 


Name : 


98BWMC13 4 


Len: 


100 


Check : 


9051 


Name: 


98BWMC14 a 


Len: 


100 


Check : 


7867 


Name : 


98BWM014_1 


Len: 


100 


Check: 


7266 . 


Name : 


98BWM018_d 


Len: 


100 


Check : 


7638 


Name: 


98BWM03 6__a 


Len : 


100 


Check : 


7495 


Name : 


98BWM037_d 


Len: 


100 


Check: 


6640 


Name : 


99BW3932 1 


Len: 


100 


Check : 


6974 


Name: 


99BW4642_4 


Len: 


100 


Check : 


6081 


Name : 


99BW4745_8 


Len : 


100 


Check : 


8860 


Name : 


99BW4754 7 


Len : 


100 


Check : 


6856 


Name : 


99BWMC16 8 


Len: 


100 


Check : 


8223 





• inn 




. 1. UU 


Weight 


inn 




. inn 

JL . U U 


nciyuL 


• inn 
X . UU 




. i n ft 
X . u u 


weignu 


. T ft f\ 

: 1.00 


WclyllL 


" 1.00 


T/Jf* "i rrVi t* 

vs c J.yn u 


• 1 ft ft 

: l . 00 




. i ft ft 
1 . 0 u 


»c Juy ii L- 


• T ft ft 

: 1.00 




• T ft r\ 

: 1.00 


ncxyxxu 


• "1 ft ft 

1.00 


vic_>yiit 


. 1 ft ft 

: 1.00 


ncxy IX u. 


• inn 
. 1 . uu 


wcxyiit, 


• 1 ft ft 

: 1.00 


Wp>"i rr"h f- 

riCXy XX L. 


• inn 
X . UU 


Weiahh 

vt c j_y ix i_ 


• inn 
1 . OO 


Weiaht- 

mcxyxiL 


• inn 
x . u u 




• inn 

* 1 • u u 


Weight 


i n n 

• l.UU 


Weight 


i n n 
. 1 . uu 


Weight 


i n n 
J. . uu 


Weight 


i nn 
* .l • u u 


Weight 


i n n 


We_* n rrVi 4- 
ivcxyiiL 


i n n 
► l.UU 


Weight 


i n n 
• 1 . uu 


Weiahi- 


i n n 
_L . u u 


micxyii l. 


i ft ft 
1 • 00 


TXTo "J rrVi 

mcxyiiL < 


t ft ft 
1.00 


TaTp* "J #-t"H ¥• « 

VVC JLyilL- « 


T ft ft 

1.00 


XaSc^ -i rrVi +- • 
vYcxyiiL • 


1 ft ft 

1 . 00 


TaJo -i rrVi 1- • 
rvcxyiiL . 


1 ft ft 

1.00 


W^l rrh-h • 


i n n 
1.00 


Weierht- ■ 


i n n 

X • UU 


Mp *i rrVi t- • 
rvcxyxiu . 


T ft ft 

1.00 


vvc: xyiiL . 


T ft ft 

1 • 00 


i^icxyiiL • 


i n n 
l.UU 


W^i eiht* • 


i n n 
l.UU 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 


Weight : 


1.00 
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00BW0762_1 
00BW0768_2 
00BW0874_2 
00BW1471J2 
00BW1616_2 
00BW1686_8 
00BW1759_3 
00BW1773_2 
0OBW1783_5 
00BW1795_6 
00BW1811_3 
00BW1859_5 
OOBW1880_2 
00BW1921_1 
00BW2036_1 
00BW2 063_6 
00BW2 087_2 
00BW2127_2 
00BW212 8_3 
00BW2276_7 
00BW3819_3 
00BW3842_8 
00BW3871_3 
00BW3876_9 
00BW3886_8 
00BW3891_6 
00BW3970_2 
'00BW5031__1 
96BW01B21 
96BW0407 
96BW0502 
96BW06_J4 
96BW11_06 
96BW1210 
96BW15B03 
96BW16_2 6 
96BW17A09 
96BWMG1_5 
96BWM03_2 
98BWMC12_2 
98BWMC13_4 
98BWMC14_a 
98BWM014_1 
98BWM018_d 
98BWM036_a 
98BWM037_d 
99BW3932__1 
99BW4642_4 
99BW4745_8 
99BW4754_7 
99BWMC16 8 



MEQAPEDQGP 
MEQAPEDQGP 
MEQPPEDQGP 
MEQPPEDQGP 
MEQPPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQPPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQPPEDQGP 
MEQPPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQPPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQPPEDQGP 
MEQTPEDQGP 
MEQAPEDQGP 
MEQVPEDQGP 
MEQVPEDQGP 
MEQSPEDQGP 
MEQFPEDQGP 
MEQPPEDQGP 
MEQPPEDQGP 
MEQAPEDQGP 
MERPPEDQGP 
MERAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
TEQAPEDQGP 
MEQPPEDQGP 
MEQTPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQPPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQAPEDQGP 
MEQPPEDQGP 
MEQPPEDQGP 
MEQAPENQGP 
MEQAPEDQGP 



QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWAL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYN.ETL 
QRVPYNEWAL 
QREPYNEWAL 
QRELYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWAL 
QRGPYNEWTL 
QREPYNEWTL 
QREPYNEWAL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWAL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWAL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWAL 
QREPYTEWAL 
QREPHNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWTL 
QREPYNEWAL 
QREPYNEWTL 
QREPYNEWAL 
QREPYNEWTL 



ELLEELKQEA 
EILEELKQEA 
EILEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
ELLEELKQEA 
ELLEELIQEA 
ELLEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
GLLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
ELLEELKQEA 
EILEELKQEA 
EILEELKQGA 
EILEELKQEA 
EILEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
EVLEELKQEA 
EILEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
EILEELKQEA 
ELLEELKQEA 
EILEELKQEA 
ALLEDLKQEA 
ELLEELKQEA 
ELLEELKQEA 
EILEELKQEA 
EILEELKQEA 
EILEELKQEA 
EVLEDLKQEA 
ELLEELKQEA 
ELLEELKQEA 



VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRIWLH 
VRHFPRPWLH 
VRHFPRLWLH 
ARHFPSSWLH 
VRHFPRTWLH 
VRHFPRPWLQ 
VRHFPRLWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRTWLH 
VRHFPRPWLH 
VRHFPRPWLQ 
VRHFPRPWLH 
VRHFPRPWLH 
VKHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRMWLH 
VRHFPGPWLH 
VRHFPPPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPR.TLH 
IRHFPIPYLQ 
VRHFPRPWLH 
VRHFPRIWLH 
VRHLPRPWLH 
VRHVPRPWLH 
VRHFPRPWLH 
VRHFP ITWLH 
VRHFLRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 
VRHFPRPWLH 



50 

SLGQHIYNTY 
NLGEYIYETY 
SLGQYIYETY 
SLGQHIYETY 
SLGQYIYENY 
SIGQYIYETY 
GLGQHIYETY 
SLGQYIYETY 
SMGQHIYNTY 
NLGQYIYNTY 
GLGQYVYETY 
SLGQYIYETY 
GLGQHIYNTY 
NLGQYIYQTY 
SLGQYIYETY 
NLGQYIYNTY 
NLGQYIYETY 
NLGQYIYETY 
GLGQYIYETY 
SLGQYIYDTY 
NLGQHIYETY 
GLGHYI YET Y 
NLGQYIYETY 
GIGQYIYETY 
NLGQHIYETY 
SLGQYVYETY 
SLGQHIYETY 
SLGQHIYETY 
GLGQYIYETY 
GLGQYIYETY 
GLGQYVYETY 
SLGQYIYETY 
SLGQHIYNTY 
SLGQYIYETY 
SLGQYIYETY 
GLGQYIYDTY 
SLGQHIYETY 
DLGQHIYNTY 
HLGQYIYETY 
SLGQYIYETY 
NLGQYVYNTY 
SLGQHIYETY 
SLGQHIYETY 
SLGQYIYETY 
NLGQYIYETY 
DLGQYIYETY 
NLGQYIYATY 
NLGQYIYETY 
SIGQYVYSTY 
DLGQHIYNTY 
SLGLYIYETY 



00BW0762_1 
00BW0768_2 
00BW0874__2 
00BW1471_2 
0 0BW1616_2 
00BW1686_8 
00BW1759 3 



51 

GDTWTGVEAI 
GDTWTGVEAL 
GDTWTGVETI 
GDTWAGVEAL 
GDTWAGVEAI 
GDTWTGVEAL 
GDTWTGVEAI 



IRILQQLLFI 
IRVLQQLLFI 
IRTLQQLLFI 
LRILQQLLFI 
TRILQQLLFI 
MRILQQLLFI 
IRILQQLLFI 



HFRIGCQHSR 
HFRIGCSHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HYRIGCQHSR 



IGIMRQ. 
IGIVRQ . 
IGILRQ . 
IGIIPQ. 
IGILRQ. 
IGILQR . 
IGIVRQ. 



100 

. - RRTRNGASRS 

. RRARNGSSRS 

. KRARNGASRS 

. RRARNGSRRS 

. RRARNGANRS 

. R.ARNGASRS 

. RRARNGANRS 
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00BW1773_2 
00BW1783_5 
00BW1795_6 
00BW1811_3 
00BW1859_5 
00BW188O_2 
.00BW1921_1 
00BW203 6_1 
00BW2063_6 
00BW2087_2 
O0BW2127_2 
00BW212 8_3 
00BW2276_7 
00BW3819_3 
00BW3842_8 
00BW3 871_3 
O0BW3876_9 
O0BW3886_8 
00BW3891_6 
00BW397 0_2 
O0BW5031_l 
96BW01B21 
96BW0407 
96BW0502 
96BW06_J4 
96BW11_06 
96BW1210 
96BW15B03 
96BW16_26 
96BW17A09 
96BWMO!_5 
96BWM03_2 
98BWMC12_2 
98BWMC13_4 
98BWMC14__a 
98BWM014_1 
98BWM018_d 
98BWM036_a 
98BWM037_d 
99BW3932_1 
99BW4642_4 
99BW4745_8 
99BW4754_J7 
99BWMC16 8 



GDTWTGVEAI 
GDTWAGVEAI 
GDTWTGVEAI 
GDTWTGVEAI 
GDTWAGVEAL 
GDTWTGVEVL 
GDTWTGVEAL 
GDTWTGVEAI 
GDTWTGVEAI 
GDTWTGVEAL 
GDTWTGVEVI 
GDTWAGVESL 
GDTWAGVEAI 
GDTWAGVEAL 
GDTWTG VET I 
GDTWTGVEAL 
GDTWTGVEAI 
GDTWTGVEAI 
GDTWTGVEAL 
GDTWTGVEAL 
GDTWMGVEAL 
GDTWTG VENM 
GDTWTGVEAL 
GDTWTGVETL 
GDTWTGVET I 
GDTWTGVEAI 
GDTWTGVEVL 
GDTWTGVEAI 
GDTWTGVE I K 
GDTWAGVEAL 
GDTWTGVEAI 
GDTWAGVLAI 
GDTWTGVEAI 
GDTWTGVEAI 
GDTWTGVEAI 
GDTWTGVEAI 
GDTWTGVEVI 
GDTWTGVEAL 
GDTWTGVET I 
GDTWMGVEAL 
GDTWAGVEAI 
GDTWTGVEAL 
GDTWTGVEAI 
GDTWTGVEVI 



IKILQQLLFI 
IRILQQLLFI 
IRTLQQLLFV 
IRILQQLLFV 
IRILQQLLFI 
IRILQQLLFI 
IRILQQPLFI 
IRILQQLLFI 
IRILQQLLFI 
IRILQQLLFT 
IRILQQLLFI 
IRMLQHLLFI 
IRILQQLLFT 
LRILQQLLFI 
IRILQQLLFI 
LRVLQQLLFV 
IRILQQLLFI 
IRMLQQLLFI 
IRMLQQLLFI 
IRILQQLLFI 

IRILQ 

IRILQQLLFV 
IRTLQQLLFI 
IRILQQLLFI 
IRILQQLLFI 
IRILQQLLFI 
TRILQQLLFI 
IRILQQLLFI 
IRILQQLLFI 
LRILQQLLFI 
TRILQQLLFI 
IRILQQLLFI 
LRILQQLLFI 
IRILQQLLFI 
IRILQQLLFI 
IRILQQLLFI 
IRILQQLLFI 
IRTLQQLLFI 
IRVLQQLLFI 
LRILQQLLFI 
IRVLQQLLFI 
MRILQQLLFI 
IRILQQLLFI 
IRILQQLLFI 



HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRSGCAHSR 
HFRIGCQHSR 
HYRFGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCHHSR 
HFRIGCQHSR 
HFRIGCSRSR 
HFRIGCQHSR 
HYRIGCAHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCLHSR 
HFRIGCQHSR 
HFRIGCHHSR 
HYRIGCQHSR 
HFRIGCSHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCQHSR 
HFRIGCH . SR 
HFRIGCQHSR 
HFRIGCHHSR 
HFRIGCRHSR 
HFRIGCHHSR 
HFRIGCQHSR 



IGILRQ . . 
IGILRQ . . 
IGIMRQ. . 
IGILQQ . . 
IGILQQ . . 
IGIIRQ. . 
IGITLP . . 
IGTLPQ . . 
IGIIRQ. . 
IGILQQ. . 
IGILRQ . . 

IDX 

IGILRQ . . 
IGILRQ . . 
IGPMRQ . . 
IGILQQ. . 
IGIVRQ . . 
IGILRQ . . 
IGILRR. . 
IGIILQ. . 
IGIILQ. . 
IGILQQ. . 
IGILRQ . . 
IGILRQ. . 
IGILQQ . . 
IGIIRQ. . 
IGILRQ . . 
IGIMRQ. . 
IGILQQ . . 
IGITPQ . . 
IGIMRQ . . 
IGIWR. . . 
IGILRQ . . 
IGILRQ . . 
IGILPR . . 
IGILRQ . . 
IGILRQ . . 
IGILRQ. . 
IGIVRQ . . 
IGILRQ . . 
IGIMQQ . . 
IGILRQ. . 
IGIIRQ. . 
IGILRQ . . 



RRARNGASRS 
RRTRNGASRS 
RRARNGTSGS 
RRARNGASRS 
RRARNGASRS 
RRTRNGASRP 
RRARNGANRS 
RRARNGASRS 
RRTRNGD SRS 
RRARNGANRS 
RRTRNGASRS 

RRARNGASRS 
RRARNGASRP 
RRARNGASRS 
RRARNGS SRS 
RRARNGANRS 
RRARNGANRS 
RRARNGASRS 
RRTRNGASRS 
RRTRNGASRS 
RRARNGASRS 
RRVRNGTNRS 
RRTRNGASRS 
RRARNGASRP 
RRTRNGASRP 
RRTRNGASRS* 
RRARNGASRS 
RRARNGARRS 
RRARNGSRRS 
RRARNGASRS 
RRARNGASRS 
RRARNGASRS 
RRTRNGASRS 
RRARNGS SRS 
RRARNGANRS 
RRARNGANRS 
RRARNGASRS 
RRARNGASRS 
RRARNGASRS 
RRARNGASRS 
RGARNGASRS 
RRTRNGASRP 
RRARNGPSRS 
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Table 19. HIV Vpu Sequence 
GCG Multiple Sequence Pile. 
Written by Omiga 1.1 



Name : 


00BW0762_1 


Len 


Name : 


00BW0768_2 


Len 


Name : 


O0BWO874_2 


Len 


Name : 


00BW1471 2 


Len 


Name : 


00BW1616 2 


Len 


Name: 


00BW1686 8 


Len 


Name : 


00BW1759_3 


Len 


Name : 


00BW1773 2 


Len 


Name : 


00BW1783 5 


Len 


Name : 


00BW1795 6 


Len 


Name: 


00BW1811_3 


Len 


Name : 


00BW1859__5 


Len 


Name : 


00BW1880 2 


Len 


Name : 


00BW1921 1 


Len 


Name : 


00BW2036 1 


Len 


Name : 


00BW2063 6 


Len 


Name : 


00BW2087 2 


Len 


Name : 


00BW2127_2 


Len 


Name : 


00BW2276 7 


Len 


Name : 


00BW3819_3 


Len 


Name: 


00BW3 842_8 


Len 


Name: 


00BW3871_3 


Len 


Name : 


00BW3 876_9 


"Len 


Name : 


O0BW3 886_8 


Len 


Name : 


00BW3 891_6 


Len 


Name : 


00BW3970_2 


Len 


Name: 


00BW5031_1 


Len< 


Name : 


96BW01B21 


Len: 


Name : 


96BW0407 


Len: 


Name : 


96BW0502 


Len: 


Name : 


96BW06_J4 


Len: 


Name : 


96BW11_06 


Len: 


Name: 


96BW1210 


Len: 


Name : 


96BW15B03 


Len : 


Name : 


96BW16_2 6 


Len: 


Name: 


96BW17A09 


Len: 


Name : 


96BWM01 5 


Len: 


Name : 


96BWM03_2 


Len: 


Name : 


98BWMC12_2 


Len : 


Name : 


98BWMC13_4 


Len: 


Name : 


98BWMC14 a 


Len: 


Name : 


98BWM014_1 


Len: 


Name : 


98BWM018_d 


Len: 


Name: 


98BWM036_a 


Len: 


Name : 


98BWM037_d 


Len: 


Name : 


99BW3932 1 


Len: 


Name : 


99BW4642_4 


Len: 


Name : 


99BW4745__8 


Len: 


Name : 


99BW4754_7 


Len: 


Name: 


99BWMC16_8 


Len: 


Name : 


A2_CD_97CD 


Len: 


Name : 


A2_CY_94CY 


Len: 


Name: 


A2D 97KR 


Len: 


Name: 


A2G__CD_97C 


Len: 


Name: 


A_BY_97BL0 


Len: 


Name : 


A_KE__Q23__A 


Len: 


Name : 


A_SE_SE659 


Len: 



106 


Check: 


4772 


Weight 


i on 


106 


Check: 


7115 


Weight 


• 1.00 


106 


Check: 


7209 


Weight 


: 1 . 00 


106 


Check : 


7341 


Weight 


: 1.00 


106 


Check : 


3870 


Weight 


: 1.00 


106 


Check : 


8787 


Weight 


1 on 


106 


Check: 


. 7584 


Weight 


i nn 

* x . u u 


106 


Check : 


7507 


Weiaht 


i nn 


106 


Check: 


7874 


Weight 


i nn 


106 


Check: 


8721 


Weight 


1 nn 

• X . W U 


106 


Check: 


3657 


Weight 


T nn 


106 


Check : 


6137 


Weight 


1 nn 


106 


Check: 


5827 


Weight 


i nn 


106 


Check : 


4605 


Weight 


T nn 


106 


Check: 


6145 


Weight 


1 nn 


106 


Check : 


7187 


Weight 


i nn 


106 


Check : 


9545 


Weight 


: 1.00 


106 


Check : 


4898 


Weight 


1 on 


106 


Check : 


7311 


Weight 


i 1.00 


106 


Check : 


4879 


Weight 


: 1.00 


106 


Check: 


1804 


Weight 


: 1.00 


106 


Check : 


6650 


Weight 


: 1.00 


106 


Check : 


6684 


Weight 


: 1.00 


106 


Check: 


8701 


Weight 


: 1 on 


106 


Check: 


8544 


Weight 


: 1.00 


106 


Check : 


9375 


Weight 


i nn 


106 


Check : 


7778 


Weight 


l nn 


106 


Check: 


6481 


Weight 


1 00 


106 


Check : 


4225 


Weight 


1 on 


106 


Check: 


5292 


Weight 


i n n 


106 


Check : 


5367 


Weiaht 


i nn 


106 


Check: 


6477 


Weight : 


i nn 


106 


Check: 


6400 


Weight : 


i n n 


106 


Check : 


2981 


Weiaht • 


T nn 

X * \J KJ 


106 


Check : 


5352 


Weight : 


i n n 


106 


Check : 


6778 


Weight : 


i no 

X . U \J 


106 


Check : 


5954 


Weight : 


i nn 

X • u u 


106 


Check : 


6334 


Weight : 


1 . 00 


106 


Check : 


6905 


Weight : 


1 00 


106 


Check : 


7458 


Weight : 


1 . 00 


106 


Check : 


4023 


Weight : 


1.00 


106 


Check : 


5708 


Weight : 


1 . 00 


106 


Check : 


7741 


Weight : 


1 . 00 


106 


Check: 


5445 


Weight : 


1 . 00 


106 


Check : 


8225 


Weight : 


1 . 00 


106 


Check : 


1112 


Weight : 


1.00 


106 


Check: 


8891 


Weight : 


1.00 


106 


Check: 


3424 


Weight : 


1.00 


106 


Check: 


5468 


Weight: 


1.00 


106 


Check: 


6656 


Weight : 


1.00 


106 


Check : 


6086 


Weight : 


1.00 


106 


Check: 


4609 


Weight : 


1.00 


106 


Check : 


2120 


Weight : 


1.00 


106 


Check: 


4405 


Weight : 


1.00 


106 


Check : 


913 Weight: 


1.00 


106 


Check: 


3380 


Weight : 


1.00 


106 


Check : 


4192 


Weight : 


1.00 
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Name: A_SE_SE725 

Name: A_SEJSE753 

Name: A_SE_SE853 

Name: A_SE_SE889 

Name: A_SE_UGSE8 

Name: A_UG_92UG0 

Name: A_UG_U455__ 

Name: AC_IN_2130 

Name: AC_RW_92RW 

Name: AC_SE_jSE94 

Name: ACD_SE_SE8 

Name: ACGJBE_VI1 

Name: AD_SE_SE69 

Name : AD_SE_SE7 1 

Name: ADHK__NO_97 

Name: ADK_CD_MAL 

Name: AG_BE_VI11 

Name: AG_NG_92NG 

Name: AGHU_GA_VI 

Name: AGU_CD_Z32 

Name: AJ BW BW21 



Name : 


B_ 


_AU 


_VH_AF 


Name : 




~CN^ 


JRL42 


Name : 


b~ 


~de~ 


"D31_U 


Name : 


B DE 


"han U 


Name : 




_FR_HXB2_ 


Name : 


b" 


_GA_OYI M 


Name : 


b] 


_GB_ 


_CAM1_ 


Name : 


b" 




~GB8_A 


Name : 


B~ 




_MANC 


Name : 


b] 


~kr~ 


_WK_AF 


Name : 


b[ 


~nl~ 


~3202A 


Name : 


b[ 


~tw" 


JTWCYS 


Name : 


B US 


~bc_lo 


Name : 




_us_ 


_DH123 


Name : 


b" 


"us" 


"jRCSF 


Name : 


b~ 


"us" 


"mncg_ 


Name : 


b~ 


jjsI 


~P896_ 


Name : 


b" 


"us" 


RF Ml 


Name : 


b" 


_US_SF2 K 


Name : 


b" 


_US_WEAU1 


Name : 


b" 


JJSJWR27 


Name: 


B™ 


_US_ 


_YU2_M 



Name: BF1_BR_93B 
Name : C_JBR_92BR0 
Name : C_BW_9 6BW0 
Name: C_BW_96BW1 
Name : C JBW_9 6B Wl 
Name : CJBW_96BW1 
Name : C_ETJ3TH2 2 
Name : C_IN_9 3 INI 
Name : C_IN_93 IN9 
Name : C_IN_9 3 IN9 
Name : C_IN_9 4 INI 
Name : C_IN_9 5 IN2 
Name : CRF 0 1_AE_C 
Name : CRFO 1_AE_C 
Name : CRF01_AE_C 
Name : CRFO 1__AE_T 
Name : CRFO 1_AE_T 
Name: CRFO 1_AE T 



Len 


: 106 


Cneck : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


; 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check • 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check: 


lien 


: 106 


Check: 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


; 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check: 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check : 


Len 


: 106 


Check: 


Len 


106 


Check : 


Len 


: 106 


Check : 


Len 


106 


Check : 


Len, 


106 


Check : 


Len: 


106 


Check : 


Len. 


106 


Check : 


Len: 


106 


Check: 


Len: 


106 


Check: 


Len: 


106 


Check : 


Len: 


106 


Check : 


Len: 


106 


Check: 


Len: 


106 


Check: 


Len: 


106 


Check: 


Len : 


106 


Check : 


Len: 


106 


Check: 


Len: 


106 


Check : 


Len: 


106 


Check : 


Len: 


106 


Check: 


Len: 


106 


Check : 


Len: 


106 


Check : 


Len: 


106 


Check: 



3244 


Weight 


: 1 


. 00 


1918 


Weight 


: 1 


. 00 


5495 


Weight 


1 


. 00 


5422 


Weight 


: 1 


. 00 


4254 


Weight 


: 1 


. 00 


4081 


Weight 


: 1 


. 00 


2987 


Weight 


: 1 


.00 


7929 


Weight 


: 1 


.00 


5133 


Weight 


: 1 


. 00 


7394 


Weight 


: 1 


. 00 


1852 


Weight 


: 1 


. 00 


6357 


Weight 


: 1 


. 00 


5734 


Weight 


: 1 


. 00 


4697 


Weight 


: 1 


. 00 


6301 


Weight 


: 1 


. 00 


4338 


Weight 


: 1 


. 00 


3500 


Weight 


: 1 


. 00 


4859 


Weight 


: 1 


. 00 


6173 


Weight 


: 1 


. 00 


9411 


Weight 


: 1 


. 00 


6158 


Weight 


1 


. 00 


5007 


Weight 


: 1 


.00 


4249 


Weight 


: 1 


. 00 


4572 


Weight 


: 1 


. 00 


6819 


Weight 


: 1 


. 00 


5240 


Weight 


: 1 


. 00 


5651 


Weight 


: 1 


. 00 


5359 


Weight 


: 1 


.00 


1955 


Weight 


: 1 


. 00 


6521 


Weight 


: 1 


.00 


2320 


Weight 


: 1 


.00 


4510 


Weight 


: 1 


. 00 


5491 


Weight 


: 1 


. 00 


3142 


Weight 


: 1 


. 00 


4669 


Weight 


: 1 


. 00 


4070 


Weight 


i 1 


. 00 


3291 


Weight 


: 1 


.00 


2280 


Weight 


: 1 


. 00 


3104 


Weight 


1 


. 00 


3857 


Weight 


1 


. 00 


3817 


Weight : 


1 


. 00 


3329 


Weight : 


1 


.00 


5184 


Weight : 


1 


. 00 


3243 


Weight : 


1 


. 00 


7645 


Weight : 


1 


, 00 


5235 


Weight : 


1 


.00 


6477 


Weight : 


1 


00 


6400 


Weight : 


1 


00 


2981 


Weight : 


1 


00 


8303 


Weight : 


1, 


00 


8376 


Weight : 


1. 


00 


6231 


Weight : 


1. 


00 


7626 


Weight : 


1. 


00 


6889 


Weight : 


1. 


00 


8199 


Weight : 


1. 


00 


4437 


Weight : 


1. 


00 


4082 


Weight : 


1. 


00 


3725 


Weight : 


1. 


00 


3201 


Weight : 


1. 


00 


3137 


Weight : 


1. 


00 


3484 


Weight : 


1. 


00 
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Name : 


CRF01_AE_T 


Len : 


106 


Check : 


Name : 


CRF01_AE_T 


Len: 


106 


Check : 


Name : 


CRF01_AE_JT 


Len: 


106 


Check : 


Name : 


CRF02_AG_F 


Len: 


106 


Check: 


Name : 


CRF02_AG_F 


Len: 


106 


Check : 


Name : 


CRF02_AG_G 


Len: 


106 


Check: 


Name : 


CRF02_AG_ 1 N 


Len: 


106 


Check: 


Name : 


CRF02_AG_JS 


Len: 


106 


Check : 


Name : 


CRF02_AG_S 


Len: 


106 


Check : 


Name : 


CRF03_AB_R 


Len: 


106 


Check : 


Name : 


CRF03__AB_R 


Len: 


106 


Check: 


Name : 


CRF04_cpx_ 


Len: 


106 


Check: 


Name : 


CRF04_cpx_ 


Len: 


106 


Check: 


Name : 


CRF04__cpx_ 


Len : 


106 


Check: 


Name : 


CRF05_DFJB 


Len: 


106 


Check : 


Name : 


CRF05_DF_B 


Len: 


106 


Check : 


Name : 


CRF06__cpx_ 


Len: 


106 


Check: 


Name : 


CRF06_cpx_ 


Len: 


106 


Check: 


Name : 


CRF06_cpx_ 


Len: 


106 


Check: 


Name: 


CRF06_cpx_ 


Len: 


106 


Check: 


Name : 


CRFll_cpx_ 


Len: 


106 


Check: 


Name : 


CRFll_cpx_ 


Len: 


106 


Check : 


Name : 


D_CD_84ZR0 


Len: 


106 


Check: 


Name : 


D_CD_ELI_K 


Len: 


106 


Check : 


Name: 


D_CD_NDK_M 


Len: 


106 


Check : 


Name : 


D_UG_94UG1 


Len: 


106 


Check : 


Name : 


F1_BE_VI85 


Len: 


106 


Check: 


Name : 


F1_BR_93BR 


Len: 


106 


Check: 


Name : 


F1_FI_FIN9 


Len: 


106 


Check : 


Name : 


F1_FR_MP41 


Len: 


106 


Check: 


Name : 


F2__CM_MP2 5 


Len: 


106 


Check: 


Name: 


F2KU_BE_VI 


Len: 


106 


Check: 


Name : 


G_BE__DRCBL 


Len: 


106 


Check : 


Name : 


G_NG_92NG0 


Len: 


106 


Check : 


Name : 


G_SE_SE616 


Len: 


106 


Check : 


Name: 


H_BE_VI991 


Len: 


106 


Check : 


Name : 


H_BE_VI997 


Len: 


106 


Check : 


Name: 


H_CF_90CF0 


Len: 


106 


Check : 


Name : 


J_SE_SE702 


Len: 


106 


Check: 


Name : 


J_SE_SE788 


Len: 


106 


Check: 


Name : 


K_CD_EQTB1 


Len: 


106 


Check : 


Name : 


K__CM_ MPS 3 5 


Len: 


106 


Check : 


Name : 


N_CM_JYBF30 


Len: 


106 


Check : 


Name: 


O_CM__ANT70 


Len: 


106 


Check : 


Name : 


0_CM_MVP51 


Len: 


106 


Check: 


Name : 


0_SN_99SE_ 


Len: 


106 


Check: 


Name : 


0_SN_99SE_ 


Len: 


106 


Check : 


Name : 


U_CD 83C 


Len: 


106 


Check : 



3491 

2300 

2481 

2748 

4618 

400 

5979 

5296 

4213 

952 

431 

6986 

8606 

5826 

5193 

5092 

3214 

3831 

4862 

3114 

4492 

9106 

5829 

2669 

4007 

2098 

4208 

4525 

5556 

4332 

5691 

4047 

3839 

825 

4456 

2728 

3468 

5568 

4413 

3659 

4999 

4729 

8961 

7210 

8490 

726 

9615 

6063 



Weight 
Weight 
Weight 
Weight 
Weight 
Weight : 
Weight 
Weight 
Weight 
Weight : 
Weight : 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight ; 
Weight : 
Weight : 
Weight : 
Weight : 
Weight: 
Weight : 
Weight ; 
Weight : 
Weight : 
Weight : 
Weight: 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight 
Weight : 
Weight : 
Weight: 



1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1. 00 
1. 00 



1. 
1. 
1. 
1. 
1. 
1, 
1. 



00 
00 
00 
00 
00 
00 
00 



1.00 
1.00 
1. 00 
1. 00 

i.bo 

1.00 
1.00 
1.00 
1. 00 
1. 00 
1. 00 
1.00 
1.00 



1, 
1, 
1. 
1, 
1. 
1. 
1. 
1. 
1. 



00 
00 
00 
00 
00 
00 
00 
00 
00 



1 

00BW0762_1 MI ELIAAVDYRI 

00BW0768_2 ML ELTARVDYRL 

00BW0874_2 ML GLSEKAGYAL 

00BW1471_2 MI NLLERVDX. . 

00BW1616_2 MLDLDYKV 

00BW1686_8 ...MEDVILS FIA.KIDYRI 

0 0BW1759_3 MID LSA.RVDYRI 

00BW1773_2 MLKLATIVDY ILAAKVDYRV 

00BW1783_5 . . .IVDVIFS LTD . RVDYRI 

00BW1795_6 MVD WTKXKVDYRI 

00BW1811_3 MLAIDYSL 



GVA 
GVG, 
GVG. 
GVG. 
AVG. 
GIA. 
GVA. 
GIG. 
AVA. 
AW. 
GVG. 



ALIIAL 
ALIVAI 
ALIVAL 
ALGIAL 
AFTVAL 
AIIVAL 
AFIIAL 
ALIAAL 
ALTIAL 
AFIVAL 
ALLVAL 



IIAIWWTIA 
ILAIWWIWA 
IIVIWWTIV 
IIVIWWTIV 
ILAIWWILV 
ILAIIVWTIV 
IIAIWWTIV 
IITIWWIIV 
IIAIWWTIV 
IIAIWWTIV 
FIAIID 



1.00 
1.00 
1.00 
: 1.00 

50 

YIEYRKLLKQ 
YIEYKELLRQ 
YIEYRKLVRQ 
YIEYRKLVRQ 
YREYRKLLRQ 
YLEYRKLVRQ 
YIEYRKLLRQ 
YREYRKLLRQ 
YIEYRKLVRQ 
YIEYRKLRKQ 
RKLVRQ 
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00BW1859_5 ML ELTARVDYRL 

00BW1880_2 MLS LMT.RVDYRI 

00BW1921_1 MLD LAA.IVDYRI 

00BW2036JL ML DLIAKVDYRV 

00BW2 063_6 MID WTE.QVDYRI 

00BW2 087_2 ML SLIERIDYRL 

00BW2127_2 ML DLAARVDYRL 

00BW2276_7 M VDLGRVDYRL 

00BW3 819_3 ML DLLTRVDYRL 

00BW3 842_8 MF DLLAGVDYRL 

00BW3871_3 MVD LLE . KVDYRI 

00BW3876_9 M LDLTQIGYEL 

00BW3 886_8 MLAAVDYRL 

00BW3891_6 ML DLLAGVDYRI 

OOBW3970_2 MF SLLERIDYRL 

00BW5 031_1 M FALFEVDYRL 

96BW01B21 ML ELIAKIDYRL 

96BW0407 ML SLAA.IDYRI 

96BW05 02 MI NFLAKVDYRL 

96BW06_J4 MID LLA.RVDYRI 

96BW11_06 MVD LLA.KVDYKI 

96BW1210 ML YLLEKVDYRL 

96BW15B03 MVN FLAAKVDYRL 

96BW16__2 6 MF SLLAKVDYRV 

96BW17A09 ML NLLAKVDYRL 

96BWM01_5 MID WTA.RVDYRV 

96BWM03_2 .ML SLAALVDYRL 

98BWMC12_2 MV VLGEKEIYIL 

98BWMC13_4 MSN WTA.RVDYRI 

98BWMC14_a MLD LTE.GVDYRI 

98BWMO!4_l ML ELLEKVDYKI 

98BWM018_d ML DLIARVDYRL 

98BWM036_a ML AFLARVDYRL 

98BWM037_d ML NLVARVDYRL 

9 9BW3 9 3 2_1 MLENIDYRL 

99BW4642_4 ML DLTARVDYRL 

99BW4745_8 MI NLIARVDYRL 

99BW4754__7 ml . ELDYRI 

99BWMC16_8 MF NLAARVDYRL 

A2_CD_97CD . . . .MSPLAILS 

A2_CY_94CY MLPLVILA 

A2D 97KR MNSLQILS 

A2G_CD_9 7C MQALEISA 

A_BY_97BL0 MTPLXIYA 

A_KE_Q2 3_A MSPLEISA 

A_SE_SE659 MQSLVILA 

A_JSE_SE725 MLPLSTIWA 

A_SE_SE753 MIPLQIWA 

A_JSE_SE853 MSA LEISALEIWS 

A_SE_JSE8 89 MTPLEIWA 

AJSE_UGSE8 MSPLEIWA 

A_UG_92UG0 MQLLEICA 

A_UG_U4 5 5_ MTPLEIWA 

AC_IN_2 130 MLNLNYEL 

AC_RW_92RW MTSLEIYA 

AC_SE_JSE94 ML DLLAEIDYRI 

ACD_SE_SE8 MQPLEICS 

ACG_BE_VI1 ML ELLAQVDYRI 

AD_SE_SE69 MQSLVILS 

AD_SE_SE7 1 MLSLLHICA 

ADHKNO_97 MYILGI 



GVG.ALIVAL IIAIIVWTIA SLEYRKLKRQ 
AVA.AFVIAL ILAIIVWTIA YLEYRKLVKQ 
TIV.AFAIAL FIAIIVWTIA YLEYRKLVRQ 
GIG . ALIVAL IIAVWWIIA YIEYRKLLKQ 
AIYXSFIVAL IIAIVIWTLA YIEYRKLSRQ 
GVG.ALIVAL IIVIIVWTIV YIEYRKLVRQ 
GVG.ALWAL IIAIIVWTIV YIEYRKLVRQ 
GVG.ALIVAL IIAIWWIIV YIEYRKLVRQ 
GVG.ALIVAL IIAIIVWTIA YIEYRKLLRQ 
GVG.ALIIAL IIAIWWVIA YIEYKKWLKQ 
GIA.AFTVAL LIAIIVWIIA YIEYRKLVRQ 
GIG. ALIVAL IIAIWWTIV YIEYRKVLRQ 
GVG . ALTVAL ILVIIVWTIV YIEYRKLVRQ 
GVG.AFLVAL SIAIWWTIV YIEYRKLLRQ 
GVG. ALLVAL IIAIWWAIV YIEYRKLVRQ 
TIG . AFIVAL FLAIWWT I A YLEYRKLVRQ 
GGG. ALIVAL SIAIWWIIA YIEYKKLIRQ 
GVG.AFWAL IIAIIVWIIV YIEYRKLVRQ 
GVG . ALIVAF IIAIWWIIA YIEYRKLLRQ 
GLA.AFWAL LIAIIVWTIV YLEYRKLVRQ 
AVA.AFIIAL IIAIWWIIV YVEYRKLVKQ 
GVG.ALIIAL IIAIIVWTIA YLEYRKVLRQ 
GVG.ALIVAL IIAIWWSIV YIEYKKLLKQ 
GVG.ALIVAL IIAIIVWIIV YIEYRKLLRQ 
GVG . ALVIAL IIAIWWIIA YIEYRKLVRQ 
AW. AFIVAL ILAIIVWIIA YLEYRKLSKQ 
GVG.ALIVAL IIAIIVWTIV YLEYRKLVRQ 
GIG. ALIVAL IIAIWWSIA YIEYRKLVRQ 
AIA. AFIVAL IIAIWWTIV YIEYRKLSRQ 
GIA.AFAVAL FIAIIVWIIV YLEYRKIVKQ 
TVA. AFIVAL IIAIWWTIA YIEYRKLLRQ 
GVG.ALIVAL IIAIIVWTIV YIEYRKLVRQ 
GVG. AFIIAL IIAIIVWTIA YLEYRKLVRQ 
GVG.ALIVAL IIAIWWTLV YIEYRKLVRQ 
GVG.ALIVAL IIVIIVWTIV YIEYRKLVRQ 
GVG.ALIVAL IIAIWWTIV YLEYRKLQRQ 
GVG.ALIIAL IIAIWWTIA YIEYRKLLKQ 
GIA. AFIIAL IIAIWWTIV YIEYRKLRRQ 
GVG . ALVIAG IIAIWWTIV YIEYRKLSRQ 
IVG. .LWAS ILAIWWTW FIEYRKIKKQ 
IVG..LIVAL ILAIWWTIV FIEYKKIKKQ 
IVG . . LIVAF ILAVWWTIV FIEYKKIKKQ 
IVG..LIVAF IAATWWTIV YIEYRKIRKQ 
IVA. .LVWF VIXXWWTIV G IX YXXLLKQ 
IVG . . LIVAL ILAIAVWTIV GIEYKKIRKQ 
IVA..LVWL IIAIWWTIV FIEWGRLRRQ 
I IG.. LIVAL ILAIWWTIV GIEYKKLLKQ 
IAG.. LIVAL ILAIIAWTIV GIEYEKLLKQ 
IVG . . LWAL ILAIWWTIV GIECKRLQKQ 
IVG..LVWS IIAIWWTIV GIEYKKLLKQ 
IVG.. LIVAL ILAIWWTIV GLEYKKLKRQ 
WG. .LWAL IIAIWWTIV GIEYKKLLKQ 
ITG.. LIVAL ILAIWWTIV GIE.KKLLKQ 
AVG. ALIVAL IIAIWWTIV YVEYRKLLRQ 
IVA. .LIVAL IIVIWWTLA GIEYKKLLKQ 
GVG. AFIIAL IIAVWWIIA YIEYRKLVRQ 
IVG..LIVAI ILAIVAWTIV GIEIKKLLRQ 
GVG . ALILAL FIAIWWIIA YLEYRKLVRQ 
IVA. .LWAL VIAIWWTIV FIEYRRLERQ 
IVG. . LLVAL IIAIWWTIV GIEYKRLLRQ 
GIG. ALIVAF IIVIWWTIV YLEYRKLVKQ 
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ADK_CD_MAL IQPLVILA IVA. .LWTL IIAIWWTIV FIEIRKIRRQ 

AG_BE_VI11 MQSLEIAA WG. .IiIVAF IAAIIVWTIV FIEYRRIRKQ 

AG_NG_92NG MQSLEIAA IAG. .LWAA IAAIWWTI KIKKQ 

AGHU_GA_VI MYILGI GIG . ALWAF IIAIWWTIA YIEYRKLLRQ 

AGU_CD_Z32 MQFLEIAS IVG. .LIVAV LAAIWW. . . . IEYRKIRKE 

AJ_BW_BW21 MRPLQIAA IVG. .VIVAI FLAIWWTIV YIEYKRLVRQ 

B_AU_VH_AF MQSLQIVA IVA. .LVWT IIAIWWTIV FIEYRRILRQ 

B_CN_RL42_ MQALTILA IVA. .LVWA IIAIWWTIV LLEYRKILRQ 

B_DE_D31_U MQSLQVAA IVA. .LVWA IIAIWWTIV FIEYRKILRQ 

B__DE_HAN_U MLSLQIAA IVA. .LWAA IIVIWWTIV VIEYRKLLRQ 

B_FR_HXB2_ TQPIPIVA IVA. .LWAI IIAIWWSIV I IEYRKILRQ 

B_GA_OYI_M IYSLQILA IVA. .LVWT IIAIWWTIV LLEYRKILRQ 

B_GB_CAM1_ MLPLQIA IVA. .LVWA IIAIWWTIV FIEYRKIRRQ 

B_GB_GB8_A IQILT IVA. .LWAG IVAIWWIIV FIEYRKILKK 

B_GB_MANC_ IVSLQIVA IVA. .LVWT IIAIWWTIV FIEYRKILRQ 

B_KR_WK_AF MQLLAILA IVG. .LWAA ILAIWWFIV FIEYKKILKQ 

B_NL_3202A MQPLWAA IVA. .LVWA IIAIWWSIV FIEYRKILRQ 

B_TW_TWCYS MQPLHILA IVA. .LWAA IIAIWWTIV LLEYRKILRQ 

B_US_BC_L0 MQPIQIAA IVA . . LVWG I IAI WW KILRQ 

B_US_DH123 MQPLVILA IVA. .LWAL IIVIWWSIV L IEYRKILRQ 

B_US_JRCSF MQPLQILA IVA. .LWAG IIAIIVWSIV LIEYRKILRQ 

B_US_MNCG_ MQPLVIAA IVA. .LWAG IIAIWS.IV FIEYRKIRRQ 

B_US_P896_ MLSLQILA IVA. .LWAA IIAIWWSIV FIEYRKILRQ 

B_US_RF_M1 MQSLEILA IVA. .LWAA ILAIWWTIV GIEIRKTLRQ 

B_US_SF2_K MQSLQILA IVS..LVWA IIAIWWTIV LIEYRKILRQ 

B_US_WEAU1 MQSLQILA IVA. .LWAG IIAIWWSIV LIEYRKILRQ 

B_US_WR27_ .. .MPLYILA WA. .LVLAA IIAIWWTIV FIEYRKILRQ 

B_US_YU2_M LQSLQVLA IVA. . LWAT IIAIWWTIV FIEYRKILRQ 

BF1_BR_93B MSYLLV IGLAALIAAL IIAIWWTIA YIEYRELVRQ 

C_BR_92BR0 ML ELIGRIDYRL GVG.ALIVAL IIVIIVWTIA YIEYRKLVRQ 

C_BW_96BW0 MI SLAARVDYRI GVG . AFIVAL IIAIIVWTIV YIEYRKLVRQ 

C_BW_96BW1 MVD LLA.KVDYKI AVA.AFIIAL IIAIWWIIV YVEYRKLVKQ 

C_BW_96BW1 ML YLLEKVDYRL GVG. ALII AL IIAIIVWTIA YLEYRKVLRQ 

C_BW_96BW1 MVN FLAAKVDYRL GVG.ALIVAL IIAIWWSIV YIEYKKLLKQ 

C_ET_ETH2 2 MV DLLAKVDYRI VIV. AFIVAL IIAIWWTIA YIEYRKLLRQ 

C_IN_93IN1 MIDLYRL GVG.ALIVAL IIAIWWTIV YIEYRKLVRQ 

C_IN_93IN9 MLDLDYKL AVG.ALIVAL IIAIWWIIA YIEYRKLVRQ 

C_IN_93IN9 IIDLDYRL GVG . ALIVAF IIAIWWTIV YIKYRKLLRQ 

C_IN_94IN1 MLDLKL AVG.ALIVAL IIAIWWTIV YIEYRRLVKQ 

C_IN_95IN2 MVNLDYKL GVG.ALIVAL IIAIWWTIV YIEYRKLVQQ 

CRF01_AE_C MSALQIA IVG. .LIVAL ILAIWWTIV FIEYRKILRQ 

CRF01_AE_C RLPLHICA IVG. .LIVAL IIAIWWTIV AIEYKKLRRQ 

CRF01_AE_C MLPLQISA IVR. .LIVAL ILAIWWTLV AIEYRILR Q 

CRF01_AE_T MSPLEISA IVG. .LIVAI ILVIWWTIV AIEFKKILGQ 

CRF01_AE_T MSPLEISA IVG.. LIVAL ILAIWWTIV AIEFKKILRQ 

CRF01_AE_T MHLWKLA IVG. . LLVAL ILAIWWTIV AIEFKKILRQ 

CRF01_AE_T MTPLEISA WG. .LIWL ILAIWWTIV GILKKILR Q 

CRF01_AE_T MTPLEISA IVG. .LIVAL ILAIWT.IV AIEVKKILRQ 

CRF01_AE_T MTPLQISA IVG. .LIVAL ILAIWT.IV I IEYKKILKQ 

CRF02_AG_F MQSLEITA IVG. .LWAF IAAIWWAIV FIEYRKIRRQ 

CRF02_AG_F MKSLEIAA IVG .. LWAI IAGIWWTIV VIEYRKIRKQ 

CRF02_AG_G MQSLEIAA IVG. .LWAF IAAIWWIIV YIEYRKIRKQ 

CRF02_AG_N MQPLTITA IVG. .LWAF IAAIWWSIV YIEYRKIRKQ 

CRF02_AG_S MNSLDIVA IVG. .LWAF IAAIWWTIV YIEYRKIRKQ 

CRF02_AG_S MQPLAVAA IVG. .LWAF ILAIWWTIV FIEYRKIRRQ 

CRF03_AB_R MQSLAIAA IVA. .LVWG IIAIWGSIV FIEYRKILRQ 

CRF03_AB_R MQFLVIAA IVA. . LWGG IIAIWWSIV FIEYRKILRQ 

CRF04_cpx_ MLFWEIWA IVG.. LWAL IIVIWWTLV FIEYKKLRRQ 

CRF04_cpx_ M SSLSSLEIWT IVA. .LIVAL IIAIWWTIV YIEYKKLQRQ 

CRF04_cpx_ M SSMSSLQVWA ITA. .LIVAL IIAIIVWTIA LIEYRKLLRQ 

CRF05_DF_B MSDLLT IAWALIVAL IIAIWWIIV YIEYRKLVRQ 
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CRF05_DF_B MSDLLA VAIAAFIVAL IIAIWWTIV YLEYRKLVRQ 

CRF06_cpx_ MQALEIAA IVG..LWAF LAAIWWTIV FIQYREIRKQ 

CRF06_cpx_ MQSLEISA IVG. .LWAF LAAIWWTIV FIQYREIRKQ 

CRF06_cpx_ MQPLEISA WG. . LIVAF IiAAIWWTIV YIQYREIRKQ 

CRF06_cpx_ MQPLEISA IVG. .LIVAF LAAIWWTIV FILHRQIKKQ 

CRFll_cpx_ MIPLQIAG IVA . . LIVAA IIVIIVWTIV FIEYRKLLKQ 

CRFIl_cpx_ MITLRTVK HSS . . ISQHL LIAIIVWTIA GIEYRRLLRH 

D_CD_84ZR0 MQSLQILA IVA. . LVLAL IIAIWWTIV FIEYRRIKRQ 

D_CD_ELI_K MQPLGIIA I AA . . LWAI ILAIWWTIV FIEYRRIKKQ 

D_CD_NDK_M MQPLVIIA IAA. .LWAI IIAIWWTIV YIEYRRIKRQ 

D_UG_94UG1 MQPLEILA IVA. .LWAL ILAIWWTIV FIECKKLRRQ 

F1_BE_VI85 MSYLLA IGIAALIVAL IIAIWWTIV YIEYKKLVRQ 

F1 - BR - 93BR MSNLLA IGIAALIVAL IITIWWTIA YIEYKKLVRQ 

F1 - FI - F1N9 MSDLLA ITIVAFIVAL IIVIWWTIV FIEYKKLVRQ 

F1_FR_MP41 MSNLYV LSIVAFIIAL IIAIWWTIV F IE YKKLLRQ 

F2_CM_MP25 MSLSLI WIAAYIWL ILAIIVWTIV YIEYKKILRQ 

F2KU_BE_VI MNL.LL VGIGALIVAF LLAIWWTIA YLEYRKVLKQ 

G_BE_DRCBL MQPLEISA IVG. .LIVAS IAAIWWTIV FIEYRKIRKQ 

G_NG_92NG0 MQALEIS LIVAF IAATIVWSIV FIEYRKIRKQ 

G_SE_SE616 MQSLVILA IVG.. LIVAF IAAIWWTIV FIEYREIRKQ 

H_BE_VI991 MNILGI GIG . ALWAF IIAIWWTIA YIEYRKLK.Q 

H_BE_VI997 MYIIGI GIG.ALIVAF IIAIWWTIV Y IE YRKLVKQ 

H_CF_90CF0 MYILGL GIG.ALWTF IIAVIVWTIV YIEYKKLVRQ 

J_SE_SE702 MVSLQIVA IVA. .LIVAF FLAICVWTIV YIEYKKLLRQ 

J_SE_SE788 MIPLQIAA IVA. . FIVAI FLAIGMWTIV YIEYKKLLRQ 

K_CD_EQTB1 . . . . MVPLT VGIIALVAAL ILAIIVWTIA YLEYRKWRQ 

K_CM_MP535 MVSLA ISIVALWAL ILAIIVWTiV YIEYRKLVKQ 

N_CM_YBF30 MLS LGF I ALGAAV SIAVIVWALL YREYKKIKLQ 

O_CM_ANT70 MHHRDLLA III..ISALL FINVILWGFI LRKYLEQKEQ 

0_CM_MVP51 MHQENLLA LIA. . LSALC LINVLIWLFN LRIYLVQRKQ 

0_SN_99SE_ MHHRDLLA LIT . . TSALL LTNWLWTF I LRQYLKQKKQ 

0_SN_99SE_ MHHRDLLT LII . . ISALL LTNVI LWAF I LRQYLRQKKQ 

U - CD 83C MSILQIVA IVA. .IIVAL ILAIWWTIV YIEYKRLLRQ 



00BW0762_1 KKIDR L IKRIIERAED SGNESDGD.H EEL STM VDMGHLRLLD 

00BW0768_2 RKIDW L IKRIRERAED SGNESEGD.T EEL STM VDMGQLRLLD 

00BW0874_2 RKID L IKRIRERAED SGNESDGD.T EEL .... STM VDMGNLRLLD 

00BW1471_2 KKIDQ L VKRIREREED SGNESDGD . I EEL STM VDMGHLRLLD 

00BW1616_2 KRIDN L IKRIRERAED SGNESDGD.T EEL STL VDMGNLRLLD 

00BW1686_8 RKIDR L IERIKEREED SGNESEGD.T EEL ATM VDMGQLRLLD 

00BW1759_3 RKINW L IERIRERAED SGNESEGD.T EEL ATM VDMGHLRLLH 

00BW1773_2 KKIDR. . . .V LEKIRERAED SGNESEGD.L EEL. . . .STL VDVEHLRLVD 

00BW1783_5 RKIDR L IERIRERAED SGNESDGD. Q EEL STM VDMGHLRLLD 

00BW1795_6 RKINW L IERIRERAED SGNESDGD. Q EEL ATM VDMGHLRLLD 

O0BW1811_3 AKIDK I IKRIRERAED SGNESEGD . I EEL STM VDMGHLRLMD 

00BW1859_5 RKIDW L IKRIRERAED SGNESEGD.T EEL ATM VDMGHLRLLD 

00BW1880_2 KKIDW L IERIRERAED SGNESDGD.H EEL STM VDMGHLRLLD 

O0BW1921_l RKIDR L IKRIRERAED SGNESEGD.T EEL ATM VDMGQLRLLG 

00BW2036_1 RKIDW L IKRIRERAED SGNESEGD. I EEL ATM VDMGHLRLLD 

O0BW2063_6 RKIDW L IKRIRERAED SGNESDGD.H EEL ATM VDLGHLRLLD 

00BW2087_2 RKLDW L VKRIRERAED SGNESDGD. V EEL STM VDMGHLRLLD 

00BW2127_2 RKIDW L IKRIGERAED SGNESEGD.T EEL ATM VDMGHLRLLD 

O0BW2276_7 RKIDW L VKRIRERAED SGNESEGD.T EEL ATM VDMGHI RLLD 

00BW3819_3 RKIDW L IKRIREREED SGNESEGD.T EEL AAM VDMGHLELLG 

00BW3842_8 KKIDW L IERIRERAED NGNESEGD.T EEL ATM VDMGHLRLLD 

00BW3871_3 RKIDW L IKRIRERAED SGNESEGD.T EEL ATM VDMGQLRLLD 

00BW3876_9 RKIDR. . . .L IKRIRERAED SGNESEGD. I EEL. . . . EAI VDMGHLRLLG 

00BW388S_8 RKIDW L IKRIRERAED SGNESEGD. I EDL AMV VDMGQLRLLD 

00BW3891_6 KRIDR L IERIRERAED SGNESEGD. I EEL STM VDMGHLRLLD 

00BW3970_2 RRLDW L VKRIRERAED SGNESEGD. N EEL STV VDLGHLRLLD 
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GJBE_DRCBL 
GJNG_92NG0 
G_JSE_SE616 
HJBE_VI991 
H_BE_VI997 
H_CF_90CF0 
J_SE__SE702 
J_SE_SE788 
K_CD_EQTB1 
K_CM_MP535 
N_CM_YBF3 0 
O_CM_ANT70 
O CM MVP51 



DN. . . . 
ANDL. . 
ADGL. . 

L 

VNDL. . 
VNDL. . 
VNDL . . 
VNDM. . 
VNDL. . 
VNDL. . 
GDNL. . 
GDNL. . 
GDNV. . 
GDNL. . 
GDNL. . 
GDNV. . 
GDNL. . 
GDNL. . 
GDNL. . 
DNDNL. 
DNDNL. 
DNL. . . 
DNDNL. 
DNDNL. 
DIDNL. 
ADDL . . 
ADDL . \ 
GDNL . . 
GNNV. . 
GNNL. . 
INNL. . 
INNL. . 
GDNL. . 
GDNL. . 
GDNL. . 
GDNL. . 
VNDL. . 
VNDL. . 
VDDDL. 
IDDL . . 
VDDL. . 
ADDM. . 
INNL. . 
INNL. . 
VNNL. . 
INNL. . 
INNL . . 
IHNL . . 
GDNL. . 
GNNL. . 
GDNL. . 
VADL. . 
VADL. . 
VADL. . 
VNDL. . 
VNDL . . 
INNL. . 
IDNL. . 
QGNWV. 
NPMFEP 
NPMFEL 
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Table 20. BLASTP Sequences producing significant alignments with S20757 (HBV 
Polymerase subtype ayw) 

Score E : 

(bits) Value 

93080 | pir | |S20757 DNA-directed DNA polymerase (EC 2.7.7.. 
8925755 |gb|AAF81607.l| DNA polymerase/ reverse transcript. 
1514497 | emb |CAA68864.l| P [Hepatitis B virus] 
27466573 |gb j AA012632.1 j polymerase [Hepatitis B virus] 
5257489|gb|AAD41360.l| polymerase [Hepatitis B virus] 
118876 |sp|P03 156 |DPOL_HPBVY P protein [Includes: DNA-dir. 
27466565|gb|AA012625.l| polymerase [Hepatitis B virus] 
67003 |pir| | JDVLVB DNA-dir ec ted DNA polymerase (EC 2.7.7.. 
59433 j emb |CAA46352.l| polymerase ORF [Hepatitis B virus] 
6692498 |gb | AAF24666.1 polymerase [Hepatitis B virus] 
6692505 |gb|AAF24673.1 polymerase [Hepatitis B virus] 
2117935 |pir| |S71785 DNA-directed DNA polymerase (EC 2.7.. 
28436101 | dbj |BAC57445.l| polymerase [Hepatitis B virus] 
631984 |pir | |S47406 DNA-directed DNA polymerase (EC 2.7.7. 
1359687 | emb |CAA66431.l| polymerase [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
, polymerase [Hepatitis B virus] >. 
118869 |sp|P24024|DPOL_HPBVA P protein [Includes: DNA-dir. 
27466525|gb|AA012672.l| polymerase [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
DNA polymerase/reverse transcrip. 
DNA polymerase/reverse transcrip., 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 



18621117 | emb | CAC87021. l| 
2843609ljdbj |BAC57437.l| 
6692512 |gb|AAF24680.l| 
22135695 | gb | AAM09037 . 1 | 
18621125 j emb | CAC87015 . 1 1 
1359679 | emb | CAA66424 . 1 | 
6692492 j gb | AAF24660*. 1 
2182121 j gb | AAB59972 . 1 
4 1402 95 j emb | CAA1053 9 . 1 1 
28436096|dbj |BAC57441.l| 
2829156|gb|AAC40810.l| 
27466519 |gb|AAO12604 .l| 



762933 |emb|CAA59514.l| 
22135690 | gb | AAM09033 . l| 
6063470 |dbj |BAA85377.l| 
6063465 jdbj |BAA85373.l| 
27466605 |gb|AAO12660 .1 | 
2829149 | gb | AAC40804 . 1 | 
475987 | gb | AAA18583 . 1 1 



313784 jemb|CAA42466.l| polymerase [Hepatitis B virus] 
27466597 | gb | AA012653 .1 polymerase [Hepatitis B virus] 
15419833 jgb j AAK97182.1 AF297620_3 polymerase [Hepatitis 
93 082 | pir | |S2 0752 DNA-directed DNA polymerase (EC 2.7.7. 



27466613 | gb 
27466589 jgb 
27466538 jgb 
27466581 jgb 
15419828 jgb 
27466544 jgb 
27466557 jgb 
16751312 jgb 
11935073 jgb 
13491150 gb 



polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
AF297619_3 polymerase [Hepatitis . 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase protein [Hepatitis B . 
AF305327_2 polymerase [Hepatitis . 
AF330110_ 3 polymerase [Hepatitis . 
pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
polymerase protein [Hepatitis B. 
„ BAA89330.1| polymerase protein [Hepatitis B . 
118872 |sp|P12900|DPOL_HPBVL P protein [Includes: DNA-dir. 
560084|dbj |BAA04927.1| DNA polymerase [Hepatitis B virus] 



6116700 
3551332 



dbj 
dbj 



AA012667 .1 
AA012646.1 
AA012618.1 
AA012639.1 
AAK97178.1 
AA012681 
AA012692 
AAL25951 
AAG41955 
AAK27856 
BAA32859 
BAA32886 



28812222 | dbj |BAC65108.l| 
6691505 | dbj 
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SS2i^TS£"": l I, polymerase tHepatitis B virus] 



6116731 
6691495 
7188655 
6063460 
3551347 
6691500 



dbj BAA32957.2) pol protein [Hepatitis B virus] 
dbj|BAA89322.l| polymerase protein [Hepatitis B 
gb|AAF37833.l|AF222323_2 polymerase [Hepatitis b! 
dbD BAA85369.ll DNA polymerase/reverse transcrip. 
dbj BAA32898.1| pol protein [Hepatitis B virus] 
dbD |BAA893 26.1| polymerase protein [Hepatitis B 

JlSSill??!?^ 65104 ^ 1 P 01 *" 0 ™" Protein [Hepatitis b! 
3551342 |dbjBAA32894.l| pol protein [Hepatitis B virus] 
628080 |pir | S43491 DNA-directed DNA polymerase (EC 2 7 7 
12246972 |gb AAG49670.il AF223956_3 polymerase [Hepatitis ' 
3551293 | dbj BAA32852.1|- pol protein [hepatitis B virus] * 
12246964 gb | AAQ49663.il AF223955_3 polymerase [Hepatitis 
SSSl^TSi^?" 03 - 11 P ol y—ase protein [HepatSs b! 
llltltl^l* ^!li? POL -^ PBVR P P rot ein [includes: DNA-dir. 

polymerase [Hepatitis B virus] 
AF286594_3 DNA polymerase [Hepati . 
AF461043_2 P protein [Hepatitis B. 
AF411409_4 polymerase [Hepatitis . 

pol protein [Hepatitis B virus] 
AF384371_2 polymerase [Hepatitis . 
DNA polymerase/reverse transcrip. 
DNA polymerase/reverse transcrip. 
pol protein [Hepatitis B virus] 
AF458664_3 polymerase [Hepatitis . 

polymerase [Hepatitis B virus] 
AF363961_2 polymerase [Hepatitis . 
DNA polymerase/reverse transcrip. 
DNA polymerase [Hepatitis B viru. 
P protein [Hepatitis B virus] 
DNA polymerase/reverse transcrip. 
polymerase [Hepatitis B virus] 
AAL66323.1|AF461359_3 polymerase [Hepatitis . 
„ AA „ ,^ .1 BAA234 55.1| DNA polymerase [Hepatitis B virus 
Inllln \ 3 BAA04919 ' 1 I DNA polymerase [Hepatitis B virus] 
479847|p ir | |S35527 DNA-directed DNA polymerase (EC 2 7 7 



6009784|dbj 
22651880 |gb 
18252591 jgb 
15778326|gb 
3551268 |dbj 
1429024l|gb 



6063435 
6063440 
3551283 
18252536 | gb 
600?769|dbj 
j 13991865 | gb 



dbj 
dbj 
dbj 



|6063425 
(2626986 
| 4490402 



dbj 
dbj 
emb 



j 22415735 | gb t 
1 10934057 j dbj |BAB16885.1 
18252556|gb 
2627009|dbj 



BAA85065.1 
AAN03491.1 
AAIi66348.1 
AAL07381 
BAA32832 
AAK59316 
BAA85353 
BAA85357 
BAA32844 
AAL66307 
BAA85053 
AAK51533 
BAA853 82 
BAA23435 
CAB38767 
AAM95242 



] 



18252545 | gb|AAL663 14.1 



dbj 
dbj 
dbj 
dbj 



1742906 
6009764 
2627002 
6063455 
10441115 |gb 
6009774 j dbj 
4490407 | emb 
3582359 jdbj 
3582355 (dbj 
12246980 | gb 
16751307 jgb 
3582375 | db j 
15778340|gb 



BAA09083 . 1 
BAA85049.1 
BAA23449.1 
BAA85365.1 
AAG16953 .1 
BAA85057.1 
CAB38771.1 
BAA32913 . 1 
BAA32874. 1 
AAG49677.1 
AAL25947.1 
BAA32 925.1 
AAL07392 .1 



4206637 |gb|AAD11755.1 



AF461357J2 polymerase [Hepatitis . 
DNA polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 
DNA polymerase/reverse transcrip.. 
AF182804_4 polymerase [Hepatitis . . 
polymerase [Hepatitis B virus] 
P protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
AF223957J3 polymerase [Hepatitis . . , 
polymerase protein [Hepatitis B . . , 
pol protein [Hepatitis B virus] 
AF411412_4 polymerase [Hepatitis . . . 
DNA polymerase [Hepatitis B virus] 



15425690|dbj|BAB64319.l| polymerase [Hepatitis B virus] 



3551352 
3582395 
5114071 
9082085 
6983935 



dbj |BAA32 902 . 1 
dbj |BAA32963 .1 



gb|AAD40205.1 
gb|AAF82723 .1 
gb|AAF34 734.1 
560094 | dbj |BAA04935.1 t 
18032033 | gb | AAL49990.1 



pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 



AF090839_2 polymerase [Hepatitis B. 
AF233236_2 pol [Hepatitis B virus] 
AF160501_2 polymerase [Hepatitis B. 
DNA polymerase [Hepatitis B virus] 
; - • ---j polymerase [Hepatitis B virus 1 

i!i«SSi?? ,BAB8a392 - l l Polymerase [Hepatitis B virus] 
6006322 1 db:,|BAA84819.l| polymerase protein [Hepatitis B . 
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1825255l|gb|AAL66319.l|AF461358_3 polymerase [Hepatitis ... 
7188649|gb|AAF37828.l|AF222322_2 polymerase [Hepatitis B. . 
12060441 jdbj | BAB20611.1 | DNA polymerase (Hepatitis B virus] 



18845085 jgb 
3551322 |dbj 
12246956 |gb 



6063430 
2288872 
1220111 
9454168 



dbj 
dbj 
dbj 



AAL79545 .1 
BAA32878 .1 
AAG49656.1 
BAA85349.1 
BAA21665.1 
BAA04072 .1 



gb|AAF87689. lj 



AF473 543_4 P protein [Hepatitis B. 

pol protein [Hepatitis B virus] 
AF223954_4 polymerase [Hepatitis . . . 
DNA polymerase/reverse transcrip... 
DNA polymerase [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 
T -. polymerase protein [Hepatitis B v. . . 

18146683 |dbj|BAB82402.1 | polymerase [Hepatitis B virus] 
3551278 | dbj |BAA32840.l| pol protein [Hepatitis B virus] 
3551372 |dbj BAA32939.1| pol protein [Hepatitis B virus] 
19849035 | gb | AAL99437 . 1 | AF405706_3 polymerase [Hepatitis 
3551357|dbj|BAA32906.l| pol protein [Hepatitis B virus] 
15778321 gb AAL07377 . 1 | AF411408_4 polymerase [Hepatitis 
15072542 gb | AAK81690.1 | polymerase protein [Hepatitis B 
21624238 dbj | BAC01109.1 | polymerase protein [Hepatitis B 
12247012 | gb | AAG49705 . 1 | AF223961_3 polymerase [Hepatitis 
5114086 gb|AAD40217.l|AF090842_2 polymerase [Hepatitis B 
3582407|dbj|BAA32972.l| pol protein [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
P protein [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 
AF241410_3 polymerase [Hepatitis . . 
AF3639~62_2 polymerase [Hepatitis . . 
DNA polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
dbj | BAB82397.1) polymerase [Hepatitis B virus] 
gb|AAG49684.l|AF223958_3 polymerase [Hepatitis . . 
emb|CAC51286.l| polymerase [Hepatitis B virus] 



15425698 | db j (BAB64325.1 
18146665|dbj |BAB82387.1 
23194252 jgb|AAN15074 .1 1 
560079 | dbj |BAA04923.l| 
10443833 |gb|AAG17595 .1 
13 991870 jgb j AAK51537 .1 
4007054 |emb|CAA10426.1 
3551362 j dbj |BAA32910.1 
18146677 " 



gb 



12246988 
15211897 
18389989,^ 
3582363 |dbj 
10441110 |gb 
10443841 jgb 
3551382 |dbj 
3582387 j db j 
3551317 jdbj 
10441104 |gb 



AAL68823 .1 
BAA32916.1 
AAG16949.1 
AAG17602 .1 
BAA32947 .1 
BAA32950 . 1 
BAA32871.1 
AAG16944 .1 



polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
AF182803_4 polymerase [Hepatitis . 
AF241411_3 polymerase [Hepatitis . 
pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
AF182802_3 polymerase [Hepatitis 



118866 |sp|P03159|DPOL_HPBV2 P protein [Includes: DNA-dir! 



15425694 |dbj |BAB64322.1 
4007049 | emb | CAA10422 .1 
2912323 9 | gb |AA062971.1 
4007064 | emb |CAA1043 8 .1 
452623 |emb|CAA53358.l| 



polymerase [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 
■ ■ --, polymerase [Hepatitis B virus] 

1825254l|gb|AAL663U.l|AF458665_3 polymerase [Hepatitis . 
527443 | emb |CAA84791.l| DNA polymerase [Hepatitis B virus] 
15211890 | emb |CAC51280.l| polymerase [Hepatitis B virus] 
329617|gb|AAA62812.l| DNA polymerase 

4007079) emb |CAA10454.l| DNA polymerase [Hepatitis B virus] 
9454173 | gb AAF87693.l| polymerase protein [Hepatitis B v. 
452628 |emb|CAA53354.1 | polymerase [Hepatitis B virus] 
3582367 dbj | BAA32919 . 1 | pol protein [Hepatitis B virus] 
5114066 |gb|AAD40201.l|AF090838_2 polymerase [Hepatitis B. 



15419860 |gb 
4490412 | emb 
18252566 |gb 
4007059 |emb 



AAK97203 .1 
CAB38775.1 
AAL66331.1 
CAA10430 .1 



AF297625_3 polymerase [Hepatitis 

P protein [Hepatitis B virus] 
AF461361_3 polymerase [Hepatitis . . 
. DNA polymerase [Hepatitis B virus] 

5114081 gb|AAD40213.l|AF090841_2 polymerase [Hepatitis B. . 
358237l|db 3 |BAA32922.l| pol protein [Hepatitis B virus] 
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12247003 |gb 
4033548 |emb 
389258l|emb 



AAG49697.1 AF223960_4 polymerase [Hepatitis ... 
CAA10450.1| DNA polymerase [Hepatitis B virus] 
^ ny ,„ ; x.^^ 09962 -}! Polymerase [Hepatitis B virus] 
5114076|gb|AAD40209.l|AF090840 2 polymerase [Hepatitis B 
12060436ldbjlBAB20607.ll DNA polymerase [Hepatitis B virus] 
118868|sp|P17100|DPOL_HPBV9 P protein [Includes : DNA-dir^ 



274 66434 | gb | AA012555 .1 
3582399|dbj |BAA32966.1 
3551273 jdbj (BAA32836.1 
14285168 | gb j AAK58873 .1 
3582391 | dbj |BAA32953.1 
15419845 |gb |AAK97191.1 



polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
polymerase [synthetic construct] . . 
pol protein [Hepatitis B virus] 

, 7 AP297622 3 polymerase [Hepatitis 

118870|sp|P17393|DPOLHPBVI P protein [Includes S^-dir" ' 
3551377 | dbj BAA32943.l| pol protein [Hepatitis B virus ]^ " 
10443825 gb AAG17588.1 | AF241409 3 polymerase [Hepatitis 
10443817 gb AAG17581.1 1 AF241408~3 ?ol£nerase BteSItitil 
?oo^r l ? b| ^ 06 ? 519 ' 11 P ol P^tein SpatitifB^s] " 
^«^I?^ P ? 1870,DPOL ~ HPBW P P»tein [includes: DNA-dir. 
6063445|db D jBAA85339.l| DNA polymerase/reverse transcrip! ! 

polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
9 t ^ G16957 -1|AF182805_4 polymerase [Hepatitis .. 

^!^I 574 * 1| ^ F2414 ° 7 - 3 P ol ^rase [Hepatitis 
• nfJftfol cabD|BAB82407.l| polymerase [Hepatitis B virus] 

h^ 10442 - 1 ' D * A P ol ^erase [Hepatitis B virus] 
,«oco«f 9 w AAK57744 - 1 Polymerase [Hepatitis B virus] 
18252561 | gb | AAL66327 . 1 AF461360 3 polymerase [Hepatitis 
6959503|gb|AAF33121.l| polymerase pro^in [orangSJan nep! ! 



19568078 
27466426 
22655601 



gb|AAL89566.1 
gb|AA012548.1 
gb|AAN04128 .1 



8161369 |gb|AAA69721.2 
10441120 
10443809 
18146689 



26224721 |gb 
4007074 |emb 
18031714 |gb 
743479l|pir 
15419855 | gb 
7434793 |pir , t 
4323205 |gb|AAD16257.l| 



12060194 
23194347 
20151228 
23884547 
21431681 



gb 

gb 

gb 
gb 



AAN76318.l| polymerase [Hepatitis B virus] 
CAA10446.1| DNA polymerase [Hepatitis B virus] 
AAK57745.1| polymerase [Hepatitis B virus] 
|S67505 DNA-directed DNA polymerase (EC 2 7 
AAK97199.1|AF297624_3 polymerase [Hepatitis'!! 
|T13468 DNA-directed DNA polymerase (EC 2.7... 

polymerase [Hepatitis B virus] 



dbj |BAB20451.l] 



AAN15122 .1 
AAM12945 .1 
AAN40009.1 
AAM53414.1 
BAA32890.1 



3551337 | dbj t 

5019933 jgb | AAD37919.1 
15419840 |gb 
600633l|dbj 
19568073 |gb 
29124918 jgb 



DNA polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
DNA polymerase/reverse transcrip . . . 
pol protein [Hepatitis B virus] 
U87747J3 DNA polymerase/reverse t. . . 
pol protein [Hepatitis B virus] 
P protein [Hepatitis B virus] 
AAK97187.1|AF297621_3 polymerase [Hepatitis . 
BAA84825.1 polymerase protein [Hepatitis B ... 
AAL89569.1| polymerase [Hepatitis B virus] 
,o 0 ^^i *L7, AA063539 ; 1 I Pol protein [Hepatitis B virus] 
329630|gb|AAA45483.l| P protein [Hepatitis B virus] 
1577833l|gb AAL07385.1 AF411410 4 polymerase [Hepatitis 
2 B ^8275.l P protein Otitis B virus ^ * ' ' 

? AB387 "- 1 P Pr ° tein hepatitis B virus 
SSISTiWISK 0445 ^ 1 DNA P ol ^erase [Hepatitis B virus] 
f?™? 1 ^ 31 ^ 8 ? 343 - 1 ! DNA Polymerase/reverse transcrip.. 
118877|sp|P03155|DPOLHPBVZ P protein [Includes: DNA-dir 
29124883|gb|AA063514.l| pol protein [Hepatitis B v^rust 

DNA polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 



emb|CAA10434.1 j 
gb|AAP24693.l| 
gb|AAF24727.l| 
23194340 | gb | AAN15116 . 1 | 
560064 | dbj | BAA04911 . 1 | 



4033543 
6692525 
6692559 
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gi 
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29124898 |gb|AA063526.l| 
29124927 jgb j AA063545. 1 j 
6692566|gb|AAF24734 .l| 
6692553 j gb j AAF24721 . 1 j 
6692518 jgb |AAF24686.1 



pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] >g. . 
polymerase [Hepatitis B virus] >.. 
pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] . . 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
. ~ , - -, polymerase [Hepatitis B virus] 

11887l|sp|P17394 |DPOL_HPBVJ P protein [Includes: DNA-dir. . 
9454473 | gb | AAF87833 . 1 [ AF282917_3 DNA polymerse [Hepatiti . . 
19224214 |gb|AAIi86445.l|AF479684_3 P gene product [Hepati . . 



1359702 jemb 
29124867 | gb 
29124872 j gb 
27466479|gb 
6692546 
3551312 



CAA66444.l| 
AAO63501 . 1 j 
AAO63505.1 j 
AAO!2576.l| 
gb|AAF24714.l| 
dbj |BAA32867.l| 



27466487 | gb j AA012611 . 1 j 



6692572 
3551297 
3551327 
1359695 
3551367 



gb iAAF24740.ll 



dbj 
dbj 
emb 
dbj 



BAA32855.l| 
BAA32882 .1 j 
CAA66434.1 j 
BAA32932.1 j 



18621110 | emb | CAC87028 . 1 | 
3582403 |dbj |BAA32969.1 
27261550 |gb|AAN85925 .1 
1914703 | emb j CAA66699 . 1 
4323200 jgb |AAD16253 .l| 
6573293 j dbj |BAA8 8291.l| 
6006341 jdbj |BAA84833.l| 
6566440|dbj |BAA88286.l| 
560059|dbj |BAA04907.l| 
14334410 | gb | AAK59391 . 1 | 
5 019954 |gb|AAD37936.1 | 
16117323 | dbj |BAB69785.1 



polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
Pol protein [Hepatitis B virus] 
118873 |sp|P17395|DPOL_HPBVO P protein [Includes: DNA-dir.. 
29124862 jgb | AA063497.1 | pol protein [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
DNA polymerase [Hepatitis B viru. . 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
P protein [Hepatitis B virus] 
polymerase protein [Hepatitis B . . 
P protein [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
P protein [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
7434792 | pir | |T13473 DNA-directed DNA polymerase (EC 2.7. . . 
5019965 j gb | AAD37945.1) P protein [Hepatitis B virus] 

pol protein [Hepatitis B virus] 
P protein [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
DNA polymerase [Hepatitis B virus] 
AAK97195.l|AF297623_3 polymerase [Hepatitis .. 
NP_037757.l| polymerase protein [orangutan h. . 

polymerase [Hepatitis B virus] 
BAB12582.1] polymerase protein [Hepatitis B 
AA012569.lj polymerase [Hepatitis B virus] 
AAG49726.1|AF223964_3 polymerase [Hepatitis 
_ AAG49719.1|AF223963_3 polymerase [Hepatitis .. 
5019945|gb|AAD37929.l| P protein [Hepatitis B virus] 
18146701 | dbj |BAB82417.l| polymerase [Hepatitis B virus] 
12247020|gb|AAG49712.l|AF223962_3 polymerase [Hepatitis 
501998l|gb|AAD37958.l| P protein [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
AF223965_3 polymerase [Hepatitis . . 
polymerase [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
polymerase protein [Hepatitis B . . 
DNA polymerase [Hepatitis B virus] 



AA063533.1 
BAA88281.1 
AA063523 .1 
AAM09065.1 



29124908|gb 
6566428 | db j 
29124894 | gb 
22135730|gb i 
560069 | dbj | BAA04915 . 1 
15419850 | gb' 
9634217 | ref _ 
16117333 | db D | BAB69793 . 1 j 
9971630 |dbj 



27466450 
12247036 
12247028 



gb 

gb 

gb 



3892582 jemb | CAA53343 .1 
27466442 | gb j AA012562.1 
22135715 jgb j AAM09053 .1 
12247044 |gb|AAG49733 .1 
22135725 jgb j AAM09061 . 1 , 
11191880 j dbj |BAB17962.i 
3551392|dbj (BAA32961.1 
6006336|dbj |BAA84829.1 
262702l|dbj |BAA23467.1 
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gii 



11191960 |db; 
11191888 jdb 1 
11191840 jdb; 
11191920 jdb; 
11191904 jdb: 
11191952 jdb; 



BAB18032.l| 
BAB17969. 1 j 
BAB17927.1| 
BAB17997.1 j 
BAB17983.1| 
BAB18025.lj 



r —- VJLXUDj 

[Hepatitis B virus] 

[Hepatitis B virus] 

[Hepatitis B virus] 

[Hepatitis B virus] 

[Hepatitis B virus] 

[Hepatitis B virus] 



T^ 23461 " 1 ', DNA P^^ase [Hepatitis B virus] 
16117328 db 3 | BAB69789.1 polymerase [Hepatitis B virus] 

2 ,l\lllll 9 * Polymerase [Hepatitis B virus ^ 

14485226|gb AAK62976 . 1 1 AF3 84372_2 polymerase [Hepa^tis 
3551288|dbj BAA32848.ll pol protein [hepatitis t " 

polymerase r " 
polymerase 
polymerase 
polymerase 
polymerase 

- --i polymerase 

1169410 | sp |Q05486 [DPOIj HPBVT P protein "[Includes: DNA-di 
22135705|gb|AAM09045.l| polymerase [Hepatitis B vi^s] 
nonf^ic^i 0 ^ 3350 * 11 Polynierase [Hepatitis B virus] 
18146695 db3|BAB82412.1 | polymerase [Hepatitis B virus] 

2S t^ 09049 ' 1 ! Polymerase [HepStitis B virus] 
c"^? 64 ^!^ 1794 ?' 1 ' Poly~rase [Hepatitis B virus] 
^™i!S ) i^ 48354 - 11 ™ v Polymerase [Hepatitis B virus] 
11191848 Jdbj | BAB17934.1 | polymerase [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
P protein [Hepatitis B virus] 

polymerase [Hepatitis B virus] 
P protein [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
P protein [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
pol protein [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
- : . Polymerase [Hepatitis B virus] 

9 u^ 66335,1 ! AF461362 - 3 Polymerase [Hepatitis ... 
gb AA012597.1| polymerase [Hepatitis B virus] 
gb|AAO!2698.l| polymerase [Hepatitis B virus] 
emb|CAC51293.l| polymerase [Hepatitis B virus] 



— | — ~ J | J- / ^ J «* . X 

22135700 |gb|AAM09041.l| 
5019976 | gb | AAD37954 . 1 | 
22135720 |gb|AAM09057.1 | 
gb|AAD37924.l| 
emb|CAA66674.l| 
emb|CAA66679.l| 
gb|AAD37949.1 | 
'dbj |BAB64328 . 1 1 
gb|AA063531.l| 
gb|AAO12704.l| 
gb|AAO12710.l| 



5019939 
1914697 
1914691 
5019970 
15425702 
29124905 
27466464 
27466471 
18252571 
27466511 
27466457 
15211905 



o T^^I^VT^ P ^ tein [includes: DNA-dir. 



1914708 |emb 
27466503 | gb 



CAA66684.1, 
AA012583 .1 j 



polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 



_ - i ' T_ , i ^*r«ciaoc tncpdcicis Jd virus J 

118867|sp|Pl2933|DPOL_HPBV4 P protein [Includes: DNA-dir 



4468850 
1914719 
9630375 
1185115 



emb 
emb 
ref 
emb 



20800461 |gb 



CAB38229.l| polymerase [Hepatitis B virus] 
CAA66694.1| polymerase [Hepatitis B virus] 
NP_J)46799 . 1 | polymerase [woolly monkey hepat . . 
CAA51254.1 DNA polymerase [Hepatitis B virus] 
AAM2 8642.1 | U87746_4 DNA polymerase/r ever se t.. 

TvTD (ZAHCZriA 1 I -rt — i 



4377612 
4377613 
1549226 
9454414 
1550614 



emb | CAA53339.1J 
emb |CAA53338 .1 j 
dbj |BAA04073.1| 
gb|AAF87797.l| 
dbj |BAA04075.1 



polymerase [Hepatitis B virus] 
polymerase [Hepatitis B virus] 
ORF [Hepatitis B virus] 
polymerase [Hepatitis B virus] 

, ORF [Hepatitis B virus] 

mb ? AA32399 - 1 l DNA polymerase [Hepatitis B virus] 
118894|sp|P03160|DPOL_WHVl P protein [Includes: DNA-dire 
9626716 |ref|NP_040994.l| A protein [Ground s^irrS Spa* 

tttlntal 9 w P^ 04547 - 1 AF410859_1 polymerase [Woodchuck . 

^««^? 7 '?5 ^f 543 - 1 '^ 410855 - 1 *Y** 11 ™tant Polymer. 
118895 |sp|P12899|DP0L_WHV59 P protein [Includes: DNA-dir 
15637597 gb AAL04548.1 | AF410860 1 polymerase [SoodchuS 
15637599 gbAAL04549.l|AF410861-l Joljmerase Woodchuck " 
j;2IIT 3| ? b l mo ««-l|^«0858:i defective polymerase [" , 
118898|sp|P17396|DPOL_WHV8I P protein [Includes I DNA-dir 
1563759l|gb|AA L 04545.l|AF410857 - l type I mutant pol^ltl [ \ 
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15637589 I gb|AAL04544.l|AF4108S6_l type IV mutant polymer, 
118897 |sp|P06275|DPOL_WHV8 P protein [Includes: DNA-dire 
3582379 dbj | BAA32928 . 1 | pol protein [Hepatitis B virus] 
9885813 |gb|AAG01539.l|AF291830_2 polymerase [Hepatitis B 
1 118875 |sp|P03158|DPOLJKPBVW DNA polymerase 



9628830 |ref 
8926931 1 dbj 
8926928 | db j 
8926925 j db j 
8926934 | db j 
|gb 
|gb 
|gb 
|gb 

|gb 
|gb 
|gb 

|gb 
|gb 



13345982 
12083172 
13345979 
12083181 
12083178 
12083163 
12083167 
13345988 
13345985 



NP_043864.l| 
BAA98025.1 
BAA98023.1 
BAA98021.1 
BAA98027.1 
AAK19538.1 
AAG48743.1 
AAK19536.1 
AAG48749.1 
AAG48747.1 
AAG48737.1 
AAG48740.1 
AAK19542.1 
AAK19540.1 



2982339 | gb | AAC06361 . 1 | 



polymerase 
pol protein 
pol protein 
pol protein 
pol protein 
AF335734_2 
AF329861_2 
AF335733_2 
AF329864_2 
AF329863_2 
AF329858_1 
AF329859_2 
AF335736J2 
AF335735 2 



[Arctic ground squir. 
[Hepatitis B virus] 
[Hepatitis B virus] 
[Hepatitis B virus] 
[Hepatitis B virus] 

[Hepatitis . 
[Hepatitis . 
[Hepatitis . 
[Hepatitis . 
[Hepatitis . 
[Hepatitis . 
[Hepatitis . 
[Hepatitis 
[Hepatitis 



polymerase 
polymerase 
polymerase 
polymerase 
polymerase 
polymerase 
polymerase 
polymerase 



336159 
118899 
225532 



gb|AAA46774 . 1 I 



_ polymerase 
DNA polymerase [Hepatitis B virus] 
polymerase protein 
sp|P11292|DPOL_WHVW6 P protein [Includes: DNA-dir. 
prf | | 1305266C gene P 
1107586 jemb | CAA56892.1 polymerase [Hepatitis B virus] 
1107579 emb j CAA56878.1 polymerase [Hepatitis B virus] 
1185116 Jemb | CAA51255.1 HBsAg [Hepatitis B virus] 
59414 |emb|CAA32405.l| DNA polymerase [Hepatitis B virus] 

^o?!no?' r !f |NP - 6476 ° 8 ; 11 P gene P r °duc t/ put . DNA polyme. 
1321828 emb|CAA96556.l| polymerase [Hepatitis B virus] 
5019960|gb|AAD37941.l| P protein [Hepatitis B virus] 
329652 gb|AAA69719.l| coat protein [Hepatitis B virus] 
329651 |gb|AAA69720.l| coat protein [Hepatitis B virus] 



27466495 | gb 



21218028|dbj|BAB96528.l| large S protein [Hepatitis B vi . 



1321832 |emb CAA96561.1 
27450190 gb AA014552.1 
27450188 gb AA014551.1 
27450192 gb AA014553.1 
27450198 gb AA014556.1 
27450196 gb AA014555.1 
27450194 gb AA014554.1 
27450200 gb AA014557.1 
27450202 gb AA014558.1 
332 8370 |gb|AAC26832.l| 



AAO12590.1) polymerase [Hepatitis B virus] 



[Hepatitis . 

[Hepatitis . 

[Hepatitis . 

[Hepatitis . 

[Hepatitis . 

[Hepatitis . 
[Hepatitis 



23380174 
23380081 
23380171 
23380180 
23380177 
23380072 
23380084 
23380078 
23380066 
23380111 
23380063 
23380087 



gb 

gb 

gb 

gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 
gb 



AAM83022.1 
AAM82960.1 
AAM83020 
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Query= gi | 93080 |pir| |S20757 DNA- directed DNA polymerase (EC 
2.7.7.7) - hepatitis B virus (subtype ayw, patient E) 

(832 letters) 
cutoff = 3e-88 

Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,376,942 sequences; 442,405,847 total letters 

BLASTP 2.2.5 (Nov-16-2002) (Altschul, S.F., et aL, "Gapped BLAST and PSI-BLAST: a new 
generation of protein database search programs", Nucleic Acids Res. 25:3389-3402 (1997)) 
against HBV subtype sequence S20757, cutoff = 3e-88 (to select human sequences). 
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Table 21: CLUSTALW alignment of 19 HBV polymerase sequences 
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JDVLJ3 
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JDVLA1 
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JQ2229 
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UNK_494018 

JDVLVS 

JDVLVR 

T13468 

T13473 

S43491 

S35527 

Prim. cons. 
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JDVLVR 
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T13473 

S43491 
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MPLSYQHFRKLLLIJ3D--EAGPLEEELPRLADEDLNRR^ 

MPLS YQHFRRLLLLDD - -EAGPLEEELPRLADEDLNRRVAEDLNLGNLNVS I PWTHKVGN 
MPLSYQHFRRLLLLDD - - EAGPLEEELPRLADEGLNRHVAEELNLGNLNVS I PWTHKVGN 
MPLS YQHFRRLLLLDD - -EAGPLEEELPRLADEGLNRRVAEDLNLGNLNVS I PWTHKVGN 
MPLSCPHFRKLLLLDE - -EAGPLEBELPRLADEGLNRRVAEDLNLQLPNVS I PWTHKVGN 
MPLSYPHFRKLLLLDD- -EAGPLEEELPRLADEDLNRRVAADLNLQLPNVS I PWTHKVGN 
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FTGLYSSTVPVFNPHWKTPSFPNIHLRQDIIKKCEQFVGPLTVNEKRRLQLIMPARFYPK 
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FTGFYSSTVPVFNPHWETPS FPNIHLHQD I IKKCEQFVGPLTVNEKRRLQLIMPARFYPK 
FTGLYS STVPVFNPKWQTPS FPD IHLHQDI INKCEQFVGPLTVNEKRRLKL 1 1 AARF YPN 

FTGLYSSTVPAFNPNWSTPSFPDIHLHQDLISKCEQFVGPLTKNELRRLKLVMPARFYPK 

FTGLYS STVPVFNPDWKTPS FPHIHLQEDI INRCQQYVGPLTVNEKRRL^IMPARF 
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VTKYLPI^KGIKPYYPEHLVNHYFQTRHYI^TLWKAGILYKRETTHSASFCGSPYSWEQE 
VTKYLPLDKGIKPYYPEHLVWHYFQTRHYIJJTLWKAGVLYKRETTHSASFCGSPYSWEQE 
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UNK_494016 

UNK_494017 ~z_zzzzzzz~zzzzzzzz 

UNK_4 94 0 15 LTK^PLDKGIKPYYPEYAVNHYFKTRHYIjHTLWKAGIIiyKRETTRSASFCGSPYSWEQE 

UNK_4 94 0 1 8 LTKYIjPI^KGIKPYYPEYAVNHYFKTRHYIjHTLWKAGILYKRETTRSASFCGSPYSWEQE 

JDVLVS LTKYLPLDKGIKPYYPEHAVNHYFKTRHYLHTLWKAGILYKRETTRSASFCGSPYSWEQE 

JDVLVR LTKYLPLDKGIKPYYPEHAVNHYFKTRHYIjHTLWKAGILYKRETTRSASFCGSPYSWEQE 

T13 4 6 8 LTKYIiPLDKGIKPYYPEHAVNHYFKTRHYLHTLWKAGILYKRETTRSASFCGSPYSWEQE 

T13473 LTKYLPIiDKGIK^YYPEHAVNHYFKTRHYLHTLWKAGIIiYKRETTRSASFCGSPYSWEQE 

S434 91 LTKYLPLDKGIKPYYPEHAVHEIYFQTRHYLHTLWKAGILYKRETTRSASFCGSPYSWEQE 

S35527 ^TKYLPLDKGI KP YYPEHAVNHYFKTRHYLHTLWKAG IL YKRETTRS AS FCGS PYS WEQE 

Prim . cons . LTKYLPLDKG I KP YYPEHAVNHYFQTRHYLHTLWKAGIL YKRETTRS AS FCGS PYS WEQE 

190 200 210 220 230 240 

I I I | | I 

JDVLKS LQHGRliVIKTSQRHGDESFCSQPSGILSRSSVGPCIRSQLKQSRLGLQPHQGPIiASSQPG 

JDVLVD LQHGRliVIKTSQRHGDESFCSQSSGILSRSSVGPCIRSQLKQSRLGLQPRQGRLASSQPS 

JDVLJ3 LQHGRLVLQTSTRHGDKSFRPQSSGILSRSPVGPCIQSQLRQSRLGPQPTQGQLAGLQQG 

S71785 LQHG AESFHQQSSGILSRPPVGSSLQSKHRKSRLGLQSQQGHLARRQQG 

S474 06 LQHG AESFHQQSSGILSRPPVGSSLQSKHSKSRLGLQSQQGHLARRQQG 

JDVLA1 LQHG AESFHQQSSGILSRPPVGSSLQSKHCKSRLGLQSQQGLLARRQQG 

S20752 LQHG AESIHQQSSGILSRPPVGSSLQSKHRKSRLGLQSQQGHLARRQQG 

S67505 LQHG AEPVCQQSLGILPRASVGSPVQSQLKQSRLGLQSQQRQLARSHQG 

UNK 2 494016 I'QHGSTSLNDTKRHGTESLCAQSSGILSRPSAGSAIQSKFQQSRLGLQHKQGQLANGKQG 

xnjK~494pi7 — - — zzzzzzzzzzzzzzzzzz 

UWK.494015 LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSLARGKSG 

UNK_4 94 0 18 LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSLARGKSG 

JDVLVS LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSLARGKSG 

JDVLVR LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSMARGKSG 

T13468 LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPGVRSQLKQSRLGLQPQQGSLARGKSG 

T13473 LQHGRLVFQTSTRHGDKSFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSLARGQSG 

S434 91 LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSLARGKAG 

S35527 LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSLARRNQG 

Prim . cons . LQHGRLVFQTSTRHGDESFCSQSSGILSRSPVGPCVRSQLKQSRLGLQPQQGSLARGQQG 

25 0 260 270 280 290 300 

I I I | I I 

JDVLKS RSGSIRARVHPSTRRCFGVEPSGSGHVDPSVNNSSSCLRQSAVRKAAYSHLSTSKRQSSS 

JDVLVD RSGSIRAKAHPSTRRYFGVEPSGSGH1DHSVNNSSSCLHQSAVRKAAYSHLSTSKRQSSS 

JDVLJ3 GSGSIRAGIHSTPWGTVGVEPSSSGHTHNCANSSSSCLHQSAVRKEAYSPVSTSKRHSSS 

S71785 RSWSIRAG1HPTARRPFGVEPSGSGHNTNLASKSASCIYQSPVRKAAYPAVSTFEKHSSS 

S474 06 RSWSIRAGIHPTARRPFGVEPSGSGHNTNLASKSASCLYQSPVRKAAYPAVSTFEKHSSS 

JDVLA1 RSWSIRAGZHPTARRPFGVEPSGSGHTTNLASKSASCLHQSPVRKATYPSVSTFEKHSSS 

S2 0 7 5 2 WSWS IRAGTHPTARRPFGVEPSGSGHTTHRASKSASCLYQSPDRKATYPSVSTFERHSSS 

S67505 RSGSIRARVHSTTRRSFRVELSGSGSNHNIASTSSSCRHQSAVRETAYSHLSTVERHSSS 

JQ22 2 9 RSGRLRSRVHTPTRWPAGVEPSSTRCVNNLASRSASCFHQSAVREKANPSLSTSKRHTST 

UNK_494016 

UNK_494017 

UNK_4 94 0 15 RSGS IWSRVHPTTRRPFGVEPSGSGHIDNTASSTSSCLHQSAVRKTAYSHLSTSKRQSSS 

UNK_494018 RSGSIWSRVHPTTRRPFGVEPSGSGHIDNTASSTSSCLHQSAVRKTAYSHLSTSKRQSSS 

JDVLVS RSGSIRARVPPTTRRSFGVEPSGSGHIDNRASSTSSCLHQSAVRKTAYSHLSTSKRQSSS 

JDVLVR RSGSIRARVHPTTRRSFGVEPSGSGHIDNSASSTSSCLHQSAVRKTAYSHLSTSKRQSSS 

T13 4 6 8 RSGS IRARVHPTTRRSFGVEPSGSGHIDNSARSASSCLHQSAVRKTAYSHLSTSKRQSSS 

T13473 RIGSIRARVHPTTRRSFGVEPSGSGHIDNSASSPSSCLHQSAVRKTAYCHLSTTKRQSSS 

S4 3 4 9 1 RSGS IRARVHPTTRRPFGVEPSGSGHIDNSASSASSCFHQSAVRKTAYSHLSTSKRQSSS 

S35527 RSGRLRARVHPTTRRSFGVEPSGSGHLDNSASSSSSCLHQSAVRKTAYSHLSTSKRQSSS 

Prim . cons . RSGSIRARVHPTTRRPFGVEPSGSGHIDNSASSSSSCLHQSAVRKTAYSHLSTSKRQSSS 
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31 ° 32 ° 3 30 340 350 360 

^™ GHAVEFHCLPPSSARPQSQGSVFSCWWLQFRMSKPCSEYCLSHLVNLREDRGPCDEHGEH 

jdvlvd ghavefhclppnsagsqsqgsvsscwwlqfpjjskpcseyclshl^pSwgpSSgS 
jdvlj3 ^avelhhvppnssrsqsqgsvlscwwlqprnskpcsehclfhi^Sgp^gS 

SEE g ^™ ppns ^ s q serpvsp ^^ 

c«lnl GR ^^ FPPNS ^ SQSERPIPPC ™ L ^^ 
Ifltli GHE ^ LYSIPPNS ^SQS T GPIL S CW W LQF M I S E^ 

2E GNA ^^ WPSSVGS Q GKGS ^ PG wwi^ 
™ W8SMS Q™»scini^^ 

UNK_494018 GHAVE 

G ^^^ ISPSPARSQSEGPIFSS ^Q F ^SIO^ 

G ^ MPC ^Q FRNS ^ G SDYCI/EmVNia^^ 

SSI? g ^ tc ™ lqfrns ^ csd *^hivnlle^ 

S43491 GHA ^HNIPPSSARSKSEGP L FPCWW L QF™ S KP^ 
S35527 GHA ™^ Pp SSARSQSEGPIFSCWWI,QF^^ 

Prim . cons . GHAVELHNIPPSSARSQSEGP2FSCWWLQFRNSKPCSDYCLSHIVNLLEDWGPCTEHGEH 

38 ° 390 400 410 420 

S5S =p= 

™!f HIRIPRTPAR ^ GG ^ L ™^^ 

odvlai hiriprtp ^^ g gvflvdknphnteesrl^ 

S20752 hiriprtparvtggvf L vdknphntaesrlvvdf S qfsr™^wpSa™ 

S67505 H1RIPRTP ^ V TGGVFLVDKNPHNTTESRL^ 

JQ2229 YIRTPRTPARVTGGVFLVDKNPHNTTESRL\A^FSQFSRGTTRV^ 

UNK.494016 NIRIPRTPARVTGGVFLVDKNPHKTTTESRLVVDFSQFS 

UNK_494018 _ _ 

Prisons. HIRIPETPMVTGGVFLVDKMPHirrTESPJ.VVDPSOPSRGSTEVSWPKPAVPMLOSLrai, 

43 ° "° «S0 460 470 480 

SS2 

JDVLA1 LSSNLSWLSI^VSA^YHLPI^PAAMPHLLVGSSGLSRYVARLSSDSRTF^oSrS 
S20752 MnmiBVaMFffl^^ 

LSS ^ S ^ SMVSAAF ^ PL HPAAMPHIiVGSSGLSRWAPiSSTSRSoS 
JQ2229 ^S^^SI^VSAAFYHLPI^AAMPHLLVGSSG^ 
TOTK_494016 MSNLSWLSLDVSAAFYHIPI^AAMPHLLVGSSGLPrI^^ 
UNK_494017 ^SM^I^SA^ 
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Prim . cons . 



LSSNLSV^SLDVSAAFYHIPLHPAAMPHLLVGSSGLPRYVARLS 
LSSNLSWLSLDVSAAFYHIPLHPAAMPHLLVGSSGLPRY\^^LSST 

L S SNLS WLSLDVS AAFYH I PLHPAAMPHLLVGS SGLPR YVAR^s S TSRNXJ^OHr ^MnnT* 
L S SNLS WLSLDVS AAFYHI PLHPAAMPHLLVGS S GLQRYVARLS STS^I^OHG^IOn^ 



LS SNLS WLSLDVS AAFYHI PLHPAAMPHLLVGS SGLSR YVARLSSTSRNTNHQ] 
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IHGTMQNL 
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OTYCSRNLYVSIiLLYQTFGRKI^YSHPIILGFRKlPMGVGLSPFLIAQPTSMC^ 
^of^ LWSLMLLYQOTGW ™ YSHPI ^ GF ^"MGVGLSPF L SQp?S 

^hcsrnlfvsi^llyktfgrklhlyshpivlgfrkipmgvgls^ 

™o^™ SLLLLFQTLGR ^^^ 

HDSCSFJnVYVSLLLLYKTFGRKLHLYSHPIILGFRKIPMGVGL^ 



roSCSPJ^YVSLr,LLYKTFGRkLHLYSHPIILGFRKIPMGGGr,SPFLrAQFTSAICS^ 

S^f^ WSLLLLYOTFGR ™ Ys ^ 

HDSCSRNLYVSLLLLYKTFGRKLHLYSHPIILGFRKIPMGVGLSPFLLAQF 
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VIGSWGTLPQDHIVQKIKHCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
IIGSWGTLPQDHIVQKIKHCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQEHIVLKIKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSYGSLPQDHIIQKIKECFRKLPINRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGCYGSLPQDHIIQKIKECFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALKPLY 
VIGCYGSLPQDHIIQKIKECFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGCYGSLPQEHIIQKIKECFRKVPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
IIGSWGSLPQDHIVQKIKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGSLPQDHIVHKIKECFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQEHIVQKLKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQEHIVQKLKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 



VIGSWGTLPQEHIVLKIKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQEHIVLKLKQCFRKLPVNSPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQEHIVQKIKQCFRKLPVNRPIDWKVCQXIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQDHIVQKLKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQEHIVQKLKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
VIGSWGTLPQEHIVLKLKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 

VIGSWGTLPQEHIVQKIKQCFRKLPVNRPIDWKVCQRIVGLLGFAAPFTQCGYPALMPLY 
670 680 690 



700 



710 



720 



ACIQAKQAFTFSPTYKAFLSKQYMNLYPVARQRPGLCQVFADATPTGWGLAIGHQRMRGT 
AC I QAKQAFTFS PT YKAFL S KQYMNL YPVARQRPGLCQVFADATPTGWGLAIGHQRMRGT 
AC IQAKQAFTFS PTYKAFLNKQYLNLYPVARQRPGLCQVPADATPTGWGLAI GHQRMRGT 
ACIQSKQAFTFSPTYKAFLCKQYLNLYPVARQRPGLCQVFADATPTGWGLVMGHQRMRGT 
AC I QS KQAFTFS PTYKAFLCKQ YLNLYPVARQRPGLCQVFADATPTGWGLVMGHQRMRGT 
ACIQSKQAFTFSPTYKAFLCKQYLNLYPVARQRPGLCQVFADATPTGWGLVMGHQRMRGT 
ACIQFKQAFTFSPTYKAFLCKQYLNLYPVARQRPGLCQVFADATPTGWGLGMGHQRMRGT 
AC I QAKQAFTFS PTYKAFLRTQYLTLYPVARQRPGLCQVFADATPTGWGLAI GHQRMRGT 
AC ITAKQAFVFS PTYKAFLCKQYMNLYP VARQRPGLCQVFADATPTGWGLAI GHQRMRGT 
ACIQSKQAFTFSPTYKAFLCKQYLNLYPVARQRSGLCQVFADATPTGWGLAIGHRRMRGT 
ACIQSKQAFTFSPTYKAFLCKQYLNLYPVARQRSGLCQVFADATPTGWGLAIGHRRMRGT 



AC I QS KQAFTFS PTYKAFLCKQ YLHLYPVARRTALCQVFADATPTGWGLAI GHRRMRGTF 
AC I QSKQAFTFS PTYKAFLCKQYLNL YPVARQRS GLCQVFADATPTGWGLAIGHRRMRGT 
AC I QAKQAFTFS PTYKAFLCKQ YLHLYPVARQRSGLCQVFADATPTGWGLAI GHRRMRGT 
ACIQSKQAFTFSPTYKAFLCKQYLNLYPVARQRSGLCQVFADATPTGWGLAIGHRRMRGT 
AC I QS KQAFTFS PTYKAFLCKQ YLNLYPVARQRSGLCQVFADATPTGWGLAIGHQRMRGT 
ACIQSKQAFTFSPTYKAFLCQQYLHLYPVARQRSGLCQVFGDATPTGWGLAIGHRRMRGT 

AC I QS KQAFTFS PTYKAFLCKQ YLNLYPVARQRPGLCQVFADATPTGWGLAIGHQRMRGT 



730 



740 



760 



770 



780 



750 

I I I , , , 

FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFPWLLGCAANWILRGTSFV 
FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFPWLLGCTANWILRGTSFV 
FVS PLPIHTVELLAACFARSRSGAKL IGTDNS WLSRKYTS FPWLLGCAANWILRGTS FV 

FLAPLPIHTAELLAACFARSRSGANILGTDNSVVLSRKYTSFPWLLGCAANWILRGTSFV 
FLAPLPIHTAELLAACFARSRSGANILGTDNSVVLSRKYTSFPWLLGCAANWILRGTSFV 
FQAPLPIHTAELLAACFARSRSGANILGTDNSVVLSRKYTSFPW 

FSAPLPIHTAELLAACFARSRSGANILGTDNSVVLSRKYTS FPWLLGCAANWILRGTS FV 
FVAPLPIHTAEIiIJ^CFARSRSGANIIGTDNSVVLSPKYTSFPWLLGCAANWILRRTSFV 
FVAPLPIHTAELLAACFARSRSGATL IGTDNS WLSRKYTS FPWLLGCAANWILRGTS FV 
FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYT^^ 

FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFPWLLGCAANWILRGTSFV 
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JDVLVS 
JDVLVR 
T13468 
T13473 
S43491 
S35527 

Prim . cons . 
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JDVLVD 

JDVLJ3 

S71785 

S47406 

JDVLA1 

S20752 

S67505 

JQ22 29 

UNK_4 94 016 

UNK_494 017 

UNK_494015 
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S43491 

S35527 

Prim. cons . 



VAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFPWLLGCAANWILRGTYFW 

FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFPWLLGCAANV7ILRGTSFV 

FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFPWLLGCAANWILRGTSFV 

FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRK^TSFPWLLGCAANV? 

FVA^IjP IHTAELLAAC FARSRS GATL IGTDNS WLSRKYTS FP WLLGCAANW I LRGTS FV 

FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVV^ 

FVAPLPIHTAELLAACFARSRSGAKLIGTDNSVVLSRKYTSFPWLLGCAANWILRGTSFV 
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800 810 820 

I I I I ! j 

YVPSALNPADDPSRGRLGLSRPLLRLPFQPTTGRTSLYAVSPSVPSHLPVRVHFASPLHV 
YVPSALNPADDPSRGRLGLSRPLLRLPFQPTTGRTSLYAVSPSVPSHLPVRVHFASPLHV 
YVPSALNPADDPSRGRLGLYRPLLRLPYRPTTGRTSLYADSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGIFRPLLRLPFRPTTGRTSLYADSPSVPSHLPVRVHFASPLHV 
YVPSALNPADDPSRGRLGLSRPLLRLPFRPTTGRTSLYADSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGLSRPLLRLPFRPTTGRTSLYADSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGLSRPLLCLPFRPTTGRTSLYADSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGLYRPLLRPWFRPTTGRTSLYAVSPSVPSHLPVRVHFASPLHV 
YVPSALNPADDPSRGRLGLYRPLLRLPFQPTTGRTSLYADSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGLYRPLLHLPFRPTTGRTSLYAVSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGLYRPLLHLPFRPTTGRTSLYAVSPSVPSHLPDRVHFASPLHV 



VPSALNPADDPSRGRLGLIRPLLHLRFRPTTGRTSLYAVSPSVPSHLPDRVHFASPLHVA 
YVPSALNPADDPSRGRLGLYRPLLLLPFRPTTGRTSLYAVSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGLYRPLLHLPFRPTTGRTSLYAVSPSVPSfflliPDRVHFASPLHV 
YVPS ALNPADDPSRGRLGL YRPLLHLPFRPTTGRTSL YAVS PS VPSHLPDRVHFAS PLHV 
YVPSALNPADDPSRGRLGLYRPLLRLSFRPTTGRTSLYAVSPSVPSHLPDRVHFASPLHV 
YVPSALNPADDPSRGRLGLYRPLLHLPFQPTTGRTSLYAVSPSVPSHLPVRVHFASPLHV 

YVPSALNPADDPSRGRLGLYRPLLRLPFRPTTGRTSLYAVS PS VPSHLPDRVHFAS PLHV 
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UNK_494016 


AWRPP 
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AWRPP 


UNK_494 015 
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CLUSTALW alignment of 19 HBV polymerare sequences representing the sybtypes adw (4), ayw (5), 
ayr (4) and adr (6) (NPS@: Network Protein Sequence Analysis, UBS Vol. 25, No 3 (291): 147-1 50, ' 
Combet C, Blanchet C, Geourjon C. and Deleage G. (March 2000)) 
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CLUSTALW options used : 

endgaps=l 

gapdist=8 

gapext=0 . 2 

gapopen=10 . 0 

hgapresidues=GPSNDQERK 

ktuple=l 

matrix=gonnet 

maxdiv=3 0 

out order =al igned 

pairgap=3 

score=percent 

topdiags=5 

type=PROTEIN 

window=5 
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Table 22. HCV Multiple Sequence Alignment 
GCG Multiple Sequence File. 
Written by Omiga l.l 



Name: BEBE1 Len 

Name: D89815 Len 

Name: ED43type_4 Len 

Name: HC_C2 Len 

Name: HCJ39 Len 

Name: HCU16326 Len 

Name: HCV_H_CMR Len 

Name : HCV_JI Len 

Name : HCV_J4 8 3 Len 

Name : HCV_J8 Len 

Name: HCV_JK1 Len 

Name : HCV__JS Len 

Name: HCV_K1_R1 Len 

Name: HCV_K1_R2 Len 

Name: HCV_JC1_R3 Len 

Name: HCV_K1_S1 Len 

Name : HCV_K1_S2 Len 

Name: HCVJK1_S3 Len 

Name : HCV_L2 Len 

Name: HCV_N Len 

Name : HCV1 2083 Len 

Name: HCV1480 Len 

Name: HCVPOLYP Len 

Name : HD_1 Len 

Name : HPCCGAA Len 

Name: HPCFG Len 

Name: HPCGENANTI Len 

Name : HPCGENOM Len 

Name: HPCHUMR Len 

Name : HPC J Len 

Name: HPCJCG Len 

Name: HPCJK046 Len 

Name: HPCJK049 Len 

Name: HPCJTA Len 

Name: HPCJTB Len 

Name: HPCK3A Len 

Name: HPCPLYPRE Len 

Name: HPCPOLP Len 

Name : HPCPP Len 

Name: HPCUNKCD Len 

Name: MKC1A Len 

Name: NDM59 Len 

Name : NZLI Len 

Name : SA13 Len 

Name : Th5 8 0 Len 

Name: Type_3a_CB Len 

Name : TypeV_D Len 

Name : VN0 0 4 Len 

Name: VN235 Len 

Name : VN4 0 5 Len 



3052 


Check : 


£» O \J 3 


Weight: 1.00 


3052 


Check : 




Weight 


: 1.00 


3052 


Check : 


4987 


Weight 


: 1.00 


3052 


Check : 


6273 


Weight 


: 1. 00 


3052 


Check : 


217 


Weight : 


1.00 


3052 


Check • 


5854 


Weight 


: 1.00 


3052 




4932 


Weight 


: 1. 00 


3052 




4947 


Weight 


: 1.00 


3052 




2553 


Weight 


: 1.00 


3052 


Check : 


9778 


Weight 


: 1.00 


3052 


Check : 


4917 


Weight 


: 1.00 


3052 


Chefk" • 


3982 


Weight 


: 1.00 


3 052 




9084 


Weight 


: 1.00 


3052 




47 Weight: 


1.00 


3052 


Check : 


1630 


Weight 


: 1.00 


3052 


Check : 


3578 


Weight 


: 1.00 


3052 


Check • 


9909 


Weight 


: 1.00 


3052 


Check : 


9508 


Weight 


: 1.00 


3052 


Check : 


4175 


Weight 


: 1.00 


3052 


Check : 


1702 


Weight 


: 1.00 


3052 




7564 


Weight 


: 1.00 


.3052 


Check ; 


.5620 


.Weight 


: . 1.00 


3052 


Check* - 


2663 


Weight 


: 1.00 


3052 


Check* • 


4040 


Weight 


: 1.00 


3052 


Check : 


5414 


Weight 


: 1.00 


3052 


Check : 


7119 


Weight 


: 1.00 


3052 


Check : 


9591 


Weight 


: 1.00 


3052 


Check* • 


2009 


Weight 


: 1.00 


3052 


Check- • 


4863 


Weight 


1.00 


3 052 




3553 


Weight 


1.00 


3052 




6658 


Weight 


1.00 


3052 


Check* • 


436 


Weight : 


1.00 


3052 


Check • 


9796 


Weight : 


1.00 


3052 




2902 


Weight : 


1.00 


3052 


Checlr * 


4237 


Weight : 


1.00 


3052 




2180 


Weight : 


1.00 


3052 




6557 


Weight : 


1.00 


3052 


Check : 


1218 


Weight: 


1 . 00 


3052 


Check: 


3845 


Weight : 


1.00 


3052 


Check: 


6214 


Weight : 


1.00 


3052 


Check : 


1615 


Weight : 


1.00 


3052 


Check: 


9717 


Weight : 


1.00 


3052 


Check: 


2255 


Weight : 


1.00 


3052 


Check: 


9158 


Weight : 


1.00 


3052 


Check: 


37 Weight: 


1.00 


3052 


Check : 


7958 


Weight : 


1.00 


3052 


Check : 


303 Weight: 


1. 00 


3052 


Check : 


3873 


Weight : 


1.00 


3052 


Check : 


7756 


Weight: 


1.00 


3052 


Check: 


7249 


Weight : 


1.00 



BEBE1 
D89815 
ED43type_4 
HC_C2 
HC G9 



MSTNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR 
MS TNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR 
MSTNPKPQRK TKRNTNRRPM DVKFPGGGQI VGGVYLLPRR 
MSTNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR 
MSTNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR 



50 

GPRLGVRAAR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRVGVRATR 



387 



WO 2005/012502 



PCTAJS2004/009510 



HCU16326 
HCV_H_CMR 
HCV_J1 
HCV_jJ4 83 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1__R1 
HCV_K1_R2 
HCV_K1_R3 
HCV_K1_S1 
HCV_K1_S2 
HCVJC1_S3 
HCV_L2 
HCVJ6J 
HCV12083 
HCV148 0 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJR046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3a_CB 
TypeV_D 
VN004 
VN235 
VN405 



BEBE1 
D89815 
ED43type_4 
HC_C2 
HC_G9 
HCU16326 
HCV_H_CMR 
HCV_J1 
HCV_J483 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1_R1 
HCV Kl R2 



MSTNPKPQRK 
MSTNPKPQRK 
MSTIPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRQ 
MSTNPKPQRK 
MSTNPKPQRK 
MSTLPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTLPKPKRQ 
MSTNGKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRQ 
MSTLPKPQRI 
MSTNPKPQRK 
MSTNPKPQRK 
MSTLPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTNPKPQRK 
MSTLPKPQRK 
MSTNPKPQRK 
MSTLPKPQRK 
MSTLPKPQRK 
MSTLPKPQRK 
MSTLPKPQRK 
MSTLPKPQKR 
MSTLPKPQRK 

51 

KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 



TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPM 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTLRRPK 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNTNRRPQ 
TKRNINRRPQ 
TKRNTYRRPQ 
TKRNTYRRPQ 
TKRNTIRRPQ 
NKRNTNRRPQ 
TKRNTNRRPQ 
IKRNTNRRPQ 
TKRNTNRRPQ 
IKRNTNRRPQ 
TKRNTSRRPQ 
TKRNTIRRPQ 
TKRNTNRRPQ 
TKRNTNRRPM 
TKRNTIRRPQ 
TKRNTIRRPQ 
TKRNTNRRPM 
NQRNTNRRPQ 
TKRNTNRRPM 



RRQPIPKDRR 
RRQPI PKARR 
RRQPIPKARR 
RRQPI PKARR 
RRQPIPKARR 
RRQPIPKARR 
RRQPIPKARR 
RRQPIPKVRR 
WRQPIPKARR 
RRQPIPKDRR 
RRQPIPKARQ 
RRQPIPKARR 
RRQPIPKARR 
RRQPIPKARQ 



DIKFPGGGQI 
DVEFPGGGQI 
DVKFPGGGQ I 
DVKFPGGGQI 
DVKFPGGGQ I 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
EVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
NVKFPAGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGVI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DIKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 
DVKFPGGGQI 



STGKSWGRPG 
PEGRTWAQPG 
PEGRSWAQPG 
PEGRAWAQPG 
PEGRSWAQPG 
PEGRAWAQPG 
PEGRTWAQPG 
PEGRTWAQPG 
PEGRAWAQPG 
STGKSWGKPG 
PEGRAWAQPG 
PEGRTWAQPG 
PEGRAWAQPG 
PEGRAWAQPG 



VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGEVYVLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYVLPRR 
VGGVYVLPRR 
VGGVYVLPRR 
YVGVYVLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYVLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYVLPRR 
VGGVYVLPRR 
VGGVYLLPRR 
VGGVYLLPRR 
VGGVYLLPRR 



YPWPLYRNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 



GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRAIR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRALR 
GPRLGVRATR 
GPRLGVRATR 
GPQLGVREVR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRAPR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPKLGVRAVR 
GPTLGVRATR 
GPTLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVCATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 
GPRLGVRATR 

100 

LGWAGWLLSP 
LGWAGWLLSP 
CGWAGWLLSP 
MGWAGWLLSP 
CGWAGWLLSP 
LGWAGWLLSP 
CGWAGWLLSP 
CGWAGWLLSP 
LGWAGWLLSP 
CGWAGWLLSP 
LGWAGWLLSP 
MGWAGWLLSP 
LGWAGWLLSP 
MGWAGWLLSP 



388 



WO 2005/012502 



PCT/US2004/009510 



HCV_K1_R3 
HCV_K1_S1 
HCV_K1_S2 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV12083 
HCV14 80 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3a_CB 
TypeVJD 
VN004 
VN235 
VN4 05 



BEBE1 
D89815 
ED43type_4 
HC__C2 
HCJ39 
HCU16326 
HCV_H_CMR 
HCV_J1 
HCV_J4 83 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1_R1 
HCV_K1_R2 
HCVJEC1_R3 ■ 
HCV_K1_S1 
HCV_K1_S2 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV12 083 
HCV1480 
HCVPOLYP 



KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KNSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTWERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRS 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERPQPRG 
KTSERSQPRG 
KTSERPQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRR 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 
KTSERSQPRG 

101 

RGSRPSWGPS 
RGSRPSWGP1J 
RGSRPSWGPN 
RGSRPSWGPN 
RGSRPSWGPS 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPTWGPT 
YGSRPRWGPT 
RGSRPNWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRSSWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRSNWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPHWGPN 
RSSRPNWGPN 
RGSRPSWGPS 



RRQPIPKVRR 
RRQPIPKARR 
RRQPIPKARQ 
RRQPIPKVRR 
RRQPIPKARQ 
RRQPIPKARR 
RRQPIPKARQ 
RRQPIPKARR 
RRQPIPKARR 
RRQLIPKARQ 
RRQPIPKARR 
RRQPTPKARP 
RRQPIPKARQ 
RRQPIPKARR 
RRQPIPKARR 
RRQPIPKARR 
RRQPIPKARR 
RRQPIPKARR 
RRQPIPRARR 
RRQPIPKARR 
RRQPIPKARR 
RRKP I PKARR 
RRQPIPKARR 
RRQPIPKDRR 
RRQPIPKARQ 
RRQPIPKARR 
RRQPIPKARQ 
RRQPIPKDRR 
RRQPIPKARR 
RRQPIPKARQ 
RRQPIPKARP 
RRQPIPKARQ 
RRQPIPKARR 
RRQPIPKARQ 
RRQPIPKARR 
RRQPIPKARQ 



DPRHKSRNLG 
DPRRRSRNLG 
DPRGRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRKSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRHRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRKSPNLG 
DPRRRSRNLG 



SEGRTWAQPG 
PEGRAWAQPG 
PEGRAWAQPG 
SEGRTWAQPG 
PEGRAWAQPG 
PEGRAWAQPG 
PQGRHWAQPG 
PTGRSWGQPG 
PEGRAWAQPG 
PEGRSWAQPG 
PEGRTWAQPG 
REGRSWAQPG 
PEGRAWAQPG 
PEGRTWAQPG 
PEGRTWAQPG 
PEGRTWAQPG 
PEGRTWAQPG 
QTGRAWGQPG 
TEGRSWAQPG 
PEGRAWAQPG 
PEGRAWAQPG 
SEGRSWAQPG 
PEGRTWAQPG 
STGKSWGKPG 
PEGRAWAQPG 
PEGRAWAQPG 
PEGRAWAQPG 
STGKSWGKPG 
SEGRSWAQPG 
PTGRSWGQPG 
SQGRTWGQPG 
SGGRSWAQPG 
SEGRSWAQPG 
PIGRSWGQPG 
QTGRTWAQPG 
SQGRHWAQPG 



KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
RVIDTITCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTPTCGF 
KVIDTLTCGL 
KVIDTLTCGF 
RVIHTLTCGF 
KVIDTLTCGF 



YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYANEG 
YPWPLYGNEG 
YPWPLYGSEG 
YPWPLYANEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YAWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYANEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 
YPWPLYGNEG 



ADLMGYIPW 
ADLMG Y I PL V 
ADLMGYIPLV 
ADLMG Y I PL V 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPW 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPW 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPW 
ADLMGYIPLV 
ADLMGYVPLV 
ADLMWYI PW 
PHLMGYI PLV 
ADLMGYIPLV 



LGWAGWLLSP 
LGWAGWLLSP 
MGWAGWLLSP 
LGWAGWLLSP 
LGWAGWLLSP 
MGWAGWLLSP 
CGWAGWLLSP 
LGWAGWLLSP 
MGWAGWLLSP 
MGWAGWLLSP 
CGWAGWLLSP 
CGWAGWLLPP 
LGWAGWLVSP 
FGWAGWLLSP 
LGWAGWLLSP 
MGWAGWLLSP 
MGWAGWLLSP 
CGWAGWLLSP 
CGWAGWLLSP 
LGWAGWLLSP 
LGWAGWLLSP 
CGWAGWLLSP 
CGWAGWLLSP 
LGWAGWLLSP 
LGWAGWLLSP 
LGWAGWLLSP 
LGWAGWLLSP 
LGWAGWLLSP 
CGWAGWLLSP 
LGWAGWLLSP 
CGWAGWLMSP 
CGWAGWLLSP 
CGWAGWLLSP 
CGWAGWLLSP 
CGWMGWLLSP 
CGWAGWLLSP 

150 

GAPVGGVARA 
GAPLGGAARA 
GAPVGSVARA 
GAPLGGAARA 
GAPLGGAARA 
GAPLGGVARA 
GAPLGGAARA 
GAPLGGAARA 
GAPLGGAARA 
GAPVGGVARA 
GAPLGGAARA 
GAPLGGVARA 
GAPLGGAARA 
GAPLGGASRA 
GPPLGGVARA 
GAPLGGTARA 
GAPLGGASRA 
GPPLGGVARA 
GAPLGGVARA 
GGPLGGAARA 
GAPLGGVAAA 
GGPVGGVSRA 
GAPLGGAARA 



389 



WO 2005/012502 



PCT/US2004/009510 



HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3a_CB 
TypeV_D 
VN004 
VN2.35 
VN405 



BEBE1 
D89815 
ED43type_4 
HC_C2 
HC_G9 
HCU16326 
HCV_H_CMR 
HCV_J1 
HCV_J483 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1_R1 
HCV_K1_R2 
HCV_K1JR3 
HCV_K1_S1 
HCV_K1_S2 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV12083 
HCV1480 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 



RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGQN 
RGSRPNWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPTWGPN 
RGSRPSWGPN 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPNWAPN 
RGSRPSWGPT 
RGSRPSWGPN 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGPT 
RGSRPSWGPN 
RGSRPNWGPN 
RGSRPSWGPN 
RGSRPSWGPN 
RGSRPSWGQN 
RGSRPNWGPN 
RGSRPHWGPN 
RGSRPNWGPN 

151 

LAHGVRVLED 
LAHGVRVLED 
LAHGVRALED 
LAHGVRALED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLEV 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLEV 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRAI ED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLEG 
LAHGVRVLED 
LAHGVRALED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRAVED 



DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRHRSRNVG 
DPRRRSRNLG 
DPRRKSRNLG 
DPRRRSRNLG 
DPRHRSRNVG 
DPRRRSRNLG 
DPRRKSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 
DPRRRSRNLG 



KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGL 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGL 
KVIDTLTCGF 
KVIDTLTCGF 
KVIDTLTCGL 
KVIDTLTCGF 
KVIDTLTCGF 



ADLMGYI PL V 
ADLMGY I PL V 
ADLMGYI PL I 
ADLMGYI PL V 
ADLMGYI PLV 
ADLMGY I PLV 
ADLMGYI PLV 
ADLMGYI PLV 
ADLMGY I PVI 
ADLMGYIPLV 
ADLMGY I PLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYI PW 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGY I PW 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGY I PW 
ADLMGYIPLV 
ADLMGYIPLV 
ADLMGY I P VL 
ADLMGYI PW 
ADLMGYI PW 



GINYATGNLP 
GVNYATGNLP 
GINYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GINYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
SVNYATGNLP 
GVNYATGNMP 
GINYATGNLP 
GINYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 



GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCPFSIFVLA 
GCSFSIFLLG 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 



LLSCISVPVS 

LLSCLTIPAS 

LLSCLTVPAS 

LLSCLTIPAS 

LLSCLTVPAS 

LLSCLTTPVS 

LLSCLTVPAS 

LLSCLTVPAS 

LLSCLTIPAS 

LLSCVTVPVS 

LLSCLTVPVS 

LLSCLTTPAS 

LLSCLTIPAS 

LLSCLTIPAS 

LLSCLTIPAS 

LLPCLTIPAS 

LLSCLTIPAS 

LLSCLTIPAS 

LLSCLTVPAS 

LLSCLTVPAS 

LLSCLTTPAS 

LLWCLTVPAS 

LLSCLTIPAS 

LLSCLTIPAS 

LLSCLTVPAS 

LFSCLTCPAS 

LLSCLTIPAS 

LLSCLTTPAS 

LLSCLTTPAS 

LLSCLTVPVS 

LLSCLTIPAS 

LLSCLTVPAS 



GAPLGGAARA 
GAPLGGAARA 
GAPVGGVARA 
GAP LG G VARA 
GAPLGGAARA 
GAPLGGAARA 
GAPLGGAARA 
GAPLGGAARA 
GGPLGGVAAA 
GAPVGGVARA 
GAPLGGAARA 
GAPLGGAARA 
GAPLGGAARA 
GAPLGGAARA 
GAPLGGVARA 
GAPLGGAARA 
GAPLGGVARA 
GAPLGGAARA 
GAPLGGVARA 
GAPVGGVARA 
GGPVGGVARA 
GGPLGGVAAA 
GAPVGGVARA 
GAPVGGVARA 
GGPLGGVAAA 
GAPLGGVAAA 
GAPLGGVAAA 

200 

AVEVRNTSSS 

AYEVRNVSGI 

AVNYRNVSGI 

AYE VRNAS G V 

AVGVRNSSGV 

AYEVRNASGM 

AYQVRNSSGL 

AYQVRNS TGL 

AYEVRNVSGI 

AVEVRNISSS 

TYEVRNVSGV 

AYEVRNVSGI 

AYEVCNASGL 

AYEVRNASGV 

AYEVRNVSGV 

AYEVCNASGL 

AYEVRNASGV 

AYEVRNVSGV 

AYEVRNVSGI 

AHE VRNAS GV 

ALTYGNS SGL 

AVPYRNASGV 

AYEVRNVSGV 

AYEVRNVSGV 

AYQVRNSSGL 

SLEYRNASGL 

AYEVHNVSG I 

AYEVRNVSGI 

AYE VHNVS GI 

AYEVRNVSGG 

AYEVRNVSGI 

AVNYANKSGI 



390 



WO 2005/012502 



PCT/US2004/009510 



HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3 a_CB 
TypeV_D 
VN004 
VN235 
VN4 05 



BEBE1 
D89815 
ED43type_4 
HC_C2 
HC_G9 
HCU16326 
HCV_H_CMR 
HCV_J1 
HCV_J483 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1__R1 
HCV_K1_R2 
HCV_K1_R3 
HCV_K1_S1 
HCV_K1J52 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV12083 
HCV14 80 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 



LAHGVRALED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRALED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRVLED 
LAHGVRALED 
LAHGVRVLED 
LAHGVRAIED 
LAHGVRALED 
LAHGVRALED 
LAHGVRAIED 
LAHGVRAVED 
LAHGVRAIED 

201 

YMATNDCSNS 
YHVTNDCSNS 
YHVTNDCPNS 
YHVTNDCSNS 
YHVTNDCPNA 
YHVTNDCSNS 
YHVTNDCPNS 
YHVTNDCPNS 
YHVTNDCSNS 
YYATND CSNN 
YHVTNDCSNS 
YHVTNDCSNS 
YHVTNDCSNS 
YHVTNDCSNA 
YHVTNDCSNS 
YHVTNDCSNS 
YHVTNDCSNA 
YHVTNDCSNS 
YHVTNDCSNS 
YHVTNDCSNS 
YHLTNDCSNS 
YHVTNDCPNS 
YHVTNDCSNA 
YHVTNDCSNS 
YHVTNDCPNS 
YLLTNDCSNR 
YHVTNDCSNS 
YHVTNDCSNS 
YHVTNDCSNA 
YHVTNDCSNS 
YHVTNDCSNS 
YHLTNDCPNS 
YTVTNDCSNG 
YHVTNDCSNS 
YHVTNDCSNS 
YVLTNDCSNS 
YHVTNDCPNS 
YMVTNDCTND 
YHVTNDCSNS 
YHVTNDCSNS 
YHVTNDCSNS 



GINFATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GINFATGNLP 
GVNYATGNLP 
GVNFATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GVNYATGNLP 
GINFATGNLP 
GVNYATGNLP 
GINYATGNLP 
GINFATGNLP 
GINFATGNLP 
GVNYATGNLP 
GINYATGNLP 
GINYATGNLP 



SIVWQLEGAV 
SIVYEAADVI 
S IVYEADHHI 
SIVYEAADMI 
S WYETENL I 
SIVYEAADMI 
S I VYEAADAI 
SIVYEAHDAI 
SIVYEAADMI 
S I TWQLTDAV 
SIVYEAADMI 
SIVYEAADII 
SIVYEAQDMI 
SIVYEAADMI 
SIVYEAEDVI 
S I VYEAEDMI 
SIVYEAADMI 
SIVYEAEDVI 
S I VYEAADL I 
S I VFEAADL I 
SIVLEADAMI 
S I VYEADNL I 
SIVYEAADMI 
SIVYETADMI 
SWYEAADAI 
SIVYEADDVI 
SIVYEAADMI 
S I VYEAADL I 
SI VYEAADL I 
SIVYEAADVI 
SIVYEAADMI 
SMVYEAEAII 
SIVYEAGDVI 
S I VYEAAGM I 
SIVYEAAGMI 
SIVYEADDVI 
SIVYEAADAI 
SITWQLQAAV 
SIVYEAADVI 
SIVYEAADMI 
SIVYEAADVI 



GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFILA 
GCSFSIFILA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 
GCSFSIFLLA 



LHTPGCVPCE 
MHAPGCVPCV 
MHLPGCVPCV 
MHNPGCVPCV 
MHLPGCVPYV 
MHTPGCVPCV 
LHTPGCVPCV 
LHTPGCVPCV 
MHTPGCVPCV 
LHLPGCVPCE 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCLPCV 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCLPCV 
MHTPGCVPCV 
MRTPGCVPCV 
LHLPGCLPCV 
LHAPGCVPCV 
MHVPGCVPCV 
MHTPGCVPCV 
LHTPGCVPCV 
LHLPGCVPCV 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCVPCV 
LHLPGCVPCI 
LHLPGCIPCV 
MHTPGCVPCV 
MHTPGCVPCV 
LHTPGCIPCV 
LHTPGCVPCV 
LHVPGCVPCE 
MHTPGCVPCV 
MHTPGCVPCV 
MHTPGCVPCV 



LLSCLLTPTA 
LLSCLTIPAS 
LLSCLTIPAS 
LFSCLIHPAA 
LLSCLTVPAS 
LLSCITTPVS 
LLSCLTIPAS 
LLSCLTTPVS 
LLSCLTIPAS 
LLSCITVPVS 
LFSCLIHPAA 
LLSCLTVPTS 
LLSCLTTPAS 
LFSCLIHPAA 
LFSCLIHPAA 
LLSCLTTPAS 
LLSCLTTPAS 
LLSCLTTPAS 



KTGN.KSRCW 
RENN.SSRCW 
REGN . QSRCW 
RENN.SSRCW 
REGN . ASRCW 
REDN.SSRCW 
REGN . ASRCW 
REGN.VSRCW 
REDN.SSRCW 
NDNG . TLHCW 
REGN. SSRCW 
REKtT. ISRCW 
RENN.SSRCW 
REAN.SSRCW 
RENN.SSRCW 
RENN.SSRCW 
REAN.SSRCW 
RENN.SSRCW 
REAN.SSRCW 
REGN. S SRC W 
RVGN.QSTCW 
LEDN.VSRCW 
RVDN.SSRCW 
REDN.SSRCW 
REGN. ASRCW 
ETDNNNTSCW 
RENN . SSRCW 
REGN.SSRCW 
REGN. SSRCW 
RENN.SSRCW 
RESN.FSRCW 
RTGN . QSRCW 
RLNN.ASKCW 
RENN. ASRCW 
RENN.VSRCW 
QDGN . TSTCW 
REGN. ASRCW 
KVGN . TSRCW 
RENN.SSRCW 
REDN.SSRCW 
RENN.SSRCW 



GLEYRNASGL 
AYQVRNASGL 
AYQVRNRSGL 
SLEWRNTSGL 
AYQVRNSTGL 
AAEVKNISTG 
AYQVRNASGV 
AYEVRNASGM 
AYQVRNASGV 
AVQVKNISDS 
SLEWRNTSGL 
AVPYRNASGV 
ALTYGNS SGL 
SLEWRNTSGL 
SLEWRNTSGL 
AI QVRNASG I 
AVHYANKSGI 
AVHYRNISGI 

250 

VPVTPNIAIN 
VALTPTLAAR 
VALTPTVAAP 
VALTPTLAAR 
VSLSPTVAAR 
VALTPTLAAR 
VAVTPTVATR" 
VAMTPTVATR 
VALTPTLAAR 
IQVTPNVAVK 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAGR 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAGR 
VALTPTLAAR 
VALTPTLAAR 
HAVS PTLATP 
VQITPTLSAP 
VALTPTLAAR 
VALTPTLAAR 
VAVTPTVATR 
TPISPTVAVK 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAAR 
TPATPTLAIP 
TPVSPTVAVS 
VALTPTLAAR 
VALTPTLAAR 
TPVTPTVAVR 
VAMTPTVATR 
IPVSPNVAVQ 
VALTPTLAAR 
VALTPTLAAR 
VALTPTLAAR 



391 



WO 2005/012502 



PCT/US2004/009510 



NDM59 
NZLI 
SA13 
Th580 
Type_3 a__CB 
TypeV_D 
VN004 
VN235 
VN405 



BEBE1 
D89815 
ED43type_4 
HC_C2 
HC_G9 
HCU16326 
HCV_H_CMR 
HCV_J1 
HCV_J483 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1_R1 
HCVJEC1JR2 
HCV_K1_R3 
HCVJC1_S1 
HCV_K1_S2 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV120 83 
HCV14 80 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HP CGENANT I 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCTNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3a__CB 
TypeV^D 
VN004 
VN235 
VN405 



YMVTNDCSND 
YVLTNDCSNS 
YHVTNDCPNS 
YHLTNDCPRS 
YVLTNDCSNS 
YVLTNDCSNS 
YHLTNDCSNN 
YHLTNDCPNS 
YHLTNDCPNS 

251 

QPGALTKGLR 
NASVPTTTLR 
YIGAPLESLR 
NASVPTTTIR 
DSRVPVSEVR 
NASVPTTTLR 
DGKLPTTQLR 
DGKLPATQLR 
NASVPTTTIR 
HRGALTRSLR 
NSSIPTTTIR 
NISVPTATIR 
NASIPTTTIR 
NSSVPTTTIR 
NSSIPTTTIR 
NASIPTTTIR 
NSSVPTTTIR 
NSSIPTTTIR 
DSSIPTATIR 
NAT I HTTTIR 
NAS TPATGFR 
SFGAVTALLR 
NASVPTTAIR 
NASIPTTTIR 
DGKLPTTQLR 
HPGVTTASIR 
NNSVPTATIR 
NATI PTATVR 
NVTIPTTTIR 
NTTVPTTTIR 
NSSIPTTTIR 
NSTVPASGFR 
RPGAATASLR 
NTSIPTTTIR 
NTSIPTTTIR 
YVGATTASIR 
DGKLPATQLR 
QPGALTQGLR 
NSSIPTTTIR 
NASVPTTTLR 
NSSIPTTTIR 
RPGALTQGLR 
YVGATTASIR 
SLGAVTAPLR 
NASVPASGFR 
YVGATTASIR 
YVGATTASIR 
NSSVPIHGFR 
NAS I PVRGFR 
NAS VP IRGFR 



SITWQLQAAV 
SIVYEADDVI 
SIVYEAEDLI 
SIVLEAEAMI 
SIVYEADDVI 
SIVYEADDVI 
SIVFEAETII 
SIVYEAEDFI 
SIIYEADNII 



LHVPGCVPCE 
LHTPGCVPCV 
LHAPGCVPCV 
LHLAGCVPCV 
LHTPGCVPCV 
LHTPGCVPCV 
LHLPGCVPCI 
MHLPGCVPCI 
MHTPGCVPCV 



KMGN, 
QDGN. 
RQGN. 
RAGN, 
QNDN, 
QDGN. 
KVGN. 
KSGN, 
KTGN. 



ISRCW 
TSTCW 
VSRCW 
ISRCW 
ISTCW 
TSTCW 
GSRCW 
GSSCW 
KSQCW 



AHIDVIVMSA 
RHVDLLVGTA 
SHVDLMVGAA 
RHVDLLVGAA 
RRVDSIVGAA 
RHVDLLVGVA 
RHIDLLVGSA 
RHIDLLVGSA 
RHVDLLVGAA 
THVDMIVMAA 
RHVDLLVGAA 
RHVDLLVGTA 
RHVDLLVGAA 
RHVDLLVGAA 
RHVDLLVGAA 
RHVDLLVGAA 
RHVDLLVGAA 
RHVDLLVGAA 
RHVDLLVGAA 
HHVDLLVGAA 
RHVDLLAGAA 
RAVDYLAGGA 
RHVDLLVGAA 
RHVDLLVGAA 
RHIDLLVGSA 
NHVNMLVAPP 
RHVDLLVGAA 
RHVDLLVGAA 
RHVDLLVGAA 
RHVDLLVGTA 
RHVDLLVGAA 
QHIDLMVGAA 
THVDMMVGAA 
RHVDLLVGAA 
RHVDLLVGAA 
SHVDLLVGAG 
RHIDLLVGSA 
THIDMWMSA 
RHVDLLVGAA 
RHVDLLVGVA 
RHVDLLVGAA 
AH IDMWMS A 
SHVDLLVGAA 
RAVDYLAGGA 
KHVDLLAGAA 
SHVDLLVGAA 
SHVDLLVGAA 
RHVDLLVGAA 
RHVDLMVGAA 
SHVDLLVGSA 



TLCSALYVGD 
AFCSAMYVGD 
TVCSGLYIGD 
ALCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
TLCSALYVGD 
TLCSALYVGD 
AFCSAMYVGD 
TACSALYVGD 
ALCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
ALCSAMYVGD 
WCSSLYIGD 
AFCSALYVGD 
TFCSAMYVGD 
AFCSAMYVGD 
TLCSALYVGD 
TLCSALYVED 
AFCSAMYVGD 
AFS S AMYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
ALCSAMYVGD 
ALCSAMYLGD 
TLCSALYVGD 
AFCSAMYVGD 
AFCSAMYVGD 
TMCSALYVGD 
TLCSALYVGD 
TLCSALYVGD 
ALCSAMYVGD 
AFCSAMYVGD 
ALCSAMYVGD 
TLCSALYVGD 
TMCSALYVGD 
ALCSALYVGD 
WCSSMYIGD 
TMCSALYVGD 
TMCSALYVGD 
AFCSAMYIGD 
AFCSAMYVGD 
AACSALYIGD 



VCGALM I AAQ 

LCGSVFLISQ 

LCGGLFLVGQ 

LCGSVFLVSQ 

LCGSIFLVGQ 

LCGSVFLVSQ 

LCGSVFLVGQ 

LCGSVFLIGQ 

LCGSVFLVSQ 

VCGAVMILSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ. 

LCGSVFLVSQ 

LCGSLFLAGQ 

ACGALS LVGQ 

LCGSVFLVAQ 

LCGSVFLVSQ 

LCGSVFLVGQ 

AFGAVSLVGQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

LCGGVFLVGQ 

LCGALFLVGQ 

LCGSVFLVSQ 

LCGSVFLVSQ 

MCGPVFLVGQ 

LCGSVFLVGQ 

LCGGVMLAAQ 

FCGSVFLVSQ 

LCGSVFLVSQ 

FCGSVFLVSQ 

LCGGVMLAAQ 

MCGAVFLVGQ 

ACGAVFLVGQ 

LCGAVFLAGQ 

MCGAVFLVGQ 

MCGAVFLVGQ 

LCGSVFLVGQ 

LCGGIFLVGQ 

LCGGVFLVGQ 



IPVSPNVAVQ 
TPVTPTVAVR 
VQITPTLSAP 
HPVSPTLAVP 
TPVTPTVAVR 
TPVTPTVAVR 
LSVSPTLAVP 
LPATLTIAVP 
VPVAPTLAVA 

300 

WWSPQHHH 

LFTFSPRRHE 

MFSFRPRRHW 

LFTFSPRRHE 

IFTFSPRHHW 

LFTFSPRRHE 

LFTFSPRRHW 

LFTFSPRRHW 

LFTFSPRRHE 

AFMVSPQRHN 

LFTFSPRRYE 

LFTFS PRWHE 

LFTFSPRRYE 

LFTFSPRRHE 

LFTFSPRRYE 

LFTFSPRRYE 

LFTFSPRRHE 

LFAFS PRRYE 

LFTFS PRLHQ 

LFTFS PRRHA 

LFAFQPRRHW 

MFTYKPRQHT 

LFTFSPRRHE 

LFTFSPRRHA 

LFTFSPRHHW 

AFTFRPRQHK 

LFTFSPRRYE 

LFTFSPRRYE 

LFTFS PRRHV 

LFTFSPRRHE 

LFTFSPRRYE 

LFTFRPRIHQ 

GFSWRHRQHW 

LFTFSPRRYE 

LFTFSPRRYE 

AFTFRPRRHR 

LFTFSPRRHW 

MFIVSPQHHW 

LFTFSPRRYE 

LFTFSPRRHE 

LFTFSPRRYE 

MFIVSPQHHH 

AFTFRPRRHQ 

MFTYSPRRHN 

LATFSPRIHD 

AFTFRPRRHQ 

AFTFRPRRHQ 

LFTFRPKHHQ 

LFS FNPRRHW 

LFTFRPRQHT 



392 



WO 2005/012502 



PCT/US2004/009510 



BEBEl 
D89815 
ED43type_4 
HC_C2 
HCJ39 
HCU16326 
HCV_H__CMR 
HCVjJl 
HCVjT4 83 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1_R1 
HCVJK1JR2 
HCVJC1JR3 
HCV_K1_S1 
HCV_K1_S2 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV12083 
HCV148 0 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM5 9 
NZLI 
SA13 
Th580 
Type_3 a_CB 
TypeV_D 
VN004 
VN235 
VN405 



301 

FVQECNCSIY 

TVQDCNCSIY 

TTQDCNCSIY 

TVQDCNCSIY 

TTQDCNCSIY 

TVQDCNCSIY 

TTQSCNCSIY 

TTQGCNCSIY 

TVQDCNCSIY 

FTQECNCSIY 

TVQDCNCSLY 

TVQDCNCSLY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSLY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSLY 

TVQDCNCSIY 

TLQDCNCS I Y 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

TTQDCNCSIY 

TVQTCNCSIY 

TVQDCNCSIY 

TIQDCNCSIY 

TLQDCNCS I Y 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQTCNCSLY 

TTQGCNCSIY 

FVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

TVQDCNCSIY 

FVQECNCSIY 

TVQTCNCSLY 

WQDCNCSIY 

ITQDCNCSVY 

TVQTCNCSLY 

TVQTCNCSLY 

VTQDCNCSIY 

WQDCNCSIY 

TVQECNCSIY 



PGKITGHRMA 

PGHVS GHRMA 

TGHITGHRMA 

PGHITGHRMA 

PGHVTGHRMA 

PGRVS GHRMA 

PGHITGHRMA 

PGHITGHRMA 

PGHLS GHRMA 

QGHI TGHRMA 

PGHVS GHRMA 

PGHVSGHRMA 

PGHVS GHRMA 

PGHVSGHRMA 

PGHITGHRMA 

PGHVSGHRMA 

PGHVSGHRMA 

PGHITGHRMA 

PGHLTGHRMA 

PGHASGHRMA 

TGHVTGHKMA 

SGHI TGHRMA 

PGHITGHRMA 

PGHVSGHRMA 

PGHITGHRMA 

PGHVSGHRMA 

PGHVTGHRMA 

PGHVTGHRMA 

PGHVSGHRMA 

PGHLS GHRMA 

PGHVSGHRMA 

TGHVTGHRMA 

PGHLTGHRMA 

PGHVSGHRMA 

PGHVSGHRMA 

PGHLS GQRMA 

PGHITGHRMA 

PGTITGHRMA 

PGHVSGHRMA 

PGRVSGHRMA 

PGHVSGHRMA 

PGAITGHRMA 

PGHLS GHRMA 

SGHITGHRMA 

TGHVTGHRMA 

PGHLSGHRMA 

PGHLS GHRMA 

AGHITGHRMA 

VGHI TGHRMA 

TGH I TGHRMA 



WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMLSWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMKWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WNMMMNWSPT 

WDMMMNWSPA 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPA 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWSPA 

WDMMMNWSPT 

WDMMMNWSPT 

WDMIMNWSPT 

WDMMMNWSPT 

WDMIMNWSPT 

WDMMMNWSPT 

WDMMMNWSPA 

WDMMMNWSPT 

WDMMMNWSPT 

WDMMMNWFPA 

WDMMMNWSPA 

WDMMLNWSPT 

WDMMMNWSPT 

WDMMMNWSPT 



TTMLLAYLVR 
AALWSQLLR 
TTLVLAQVMR 
TALWSQLLR 
GALWAQLLR 
TALWSQLLR 
AALWAQLLR 
AALVMAQLLR 
TALWSQLLR 
LTM I LAYAAR 
TALWSQLLR 
AALWSQLLR 
TALWSQLLR 
TALWSQLLR 
TALWSQLLR 
TALWSQLLR 
TALWSQLLR 
TALWSQLLR 
AALWSQLLR 
TALWSQLLR 
TTLVLSS ILR 
TALLMAQLLR 
AALWSQLLR 
AALWSQLLR 
AALWAQLLR 
IGLVISHLMR 
TALWSQLLR 
TALWSQLLR 
TALWSQLLR 
AALWSQLLR 
TALWSQLLR 
ATFWSSALR 
MTLIVSQVLR 
TALWSQLLR 
TALWSQLLR 
VGMWAHILR 
TALVMAQLLR 
ATM I LA Y AMR 
TALWSQLLR 
TALWSQLLR 
TALWSQLLR 
ATM I LA YAMR 
VGMWAHVLR 
TALVMAQLLR 
TTLVLSS ILR 
LGMAVAHVLR 
VGMWSHVLR 
VSYWSSALR 
ATLVLSYVMR 
VTFITSSLLR 



350 

IPEWLDIIT 
I PQAVMDMVA 
IPTTLVDLLS 
I PQAVMDMVA 
IPQAIVDMIA 
IPQAWDMVT 
IPQAIMDMIA 
IPQAILDMIA 
I PQAWDMVA 
VPELVLEIIF 
IPQAWDMW 
I PQAWDMVA 
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I P QAWDMVA 
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351 4Qo 
GGHWGVMFGL AYFSMQGAWA KWVILLLTA GVEASTYTTG AWGRSTHLF 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGHTRVTG GVQGHVTSTL 
GGHWGVLVGV AYFSMQANWA KVILVLFLFA GVDAETHVSG AAVGRSTAGL 
GAHWGVLAGL AYYSMVGNWA KVLIVLLLFA GVDGNTYVTG GAAARGASGI 
GAHWGVLAGL AYYSMVGNWA KWWLLLFA GVDAETRVTG GAAGHTAFGF 
GSHWGILAGL AYYSMVGNWA KVLIAMLLFA GVDGTTHVTG GAQGRAASSL 
GAHWGVLAGI AYFSMVGNWA KVLWLLLFA GVDAETHVTG GSAGHTTAGL 
GAHWGVLAGI AYFSMVGNWA KVLWLLLFA GVDAETIVSG GQAARAMSGL 
GAHWGVLAGL AYYSMVGNWA KVLIVALLFA GVDGETYTSG GAASHTTSTL 
GGHWGWFGL AYFSMQGAWA KVIAILLLVA GVDATTYSSG QEAGRTVAGF 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGTTYVSV GHASQTTRRV 
GAHWGVLAGL AYYSMIGNWA KVLIVMLLFA GADGTTHVTG GVQAHGAYGL 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGNTYTTG AAQGRTVSRL 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGNTYVSG GAKSHTTOGL 
GAHWGVLAGL VYYSMVGNWA KVLIVMLLFV GVDGSTHVSG GTTAYNTRSF 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGDTYTTG GAQGHTT SRV 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGRTTVTG GAQGHTTORL 
GAHWGVLAGL VYYSMVGNWA KVLIVMLLFV GVDGSTRVSG GTTAYNTRGL 
GAHWGVLAGL AYYPMVGNWA KVLIVMLLFA GVDGTTVTMG GTVARTTYGF 
GAHWGVLAGL AYYSMAGNWA KVLIVMLLFA GVDGHTLTTG GHAAHLTSGF 
GGHWGILLAV AYFGMAGNWL KVLAVLFLFA GVEAQT . MIA HGVSQTTSGF 
GGHWGVLLAA AYF AS TANWA KVILVLFLFA GVDGRTHTVG GTVGQGLKSL 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGDTHTTG GVAGRDTLRF 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGTTTVTG GSOARTVYEL 
G AHWGVLAG I KYFSMVGNWA KVLWLLLFA GVDAETHVTG GNAGRTTAGL 
GAHWGVMAGL AYF SMQGNWA KWIVLIMFS GVDATTHTTG GSAAQATAGF 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGSTIVSG GTVARTTHSL 
GAHWGVLAGL AYYAMVGNWA KVLIVMLLFA GVDGDTYASG GAQGRS TLGF 
GAHWGVLAGL AYYSMAGNWA KVLIVMLLFA GVDGDTHVTG GAQAKTTNRL 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGQTYTTG GAVARTTTGF 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGHTHVTG GRVASSTQSL 
GGHWGIIGAL LYYSTAANWA KVI I VLLLFA GVDAST.YVA SSVSQATSGL 
GAHWGVMAGV AYYSMQGNWA KVFLVLCLFS GVDASTTITG GVAAS GAFTI 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGVTYTTG GSQARHTOSV 
GAHWGVLAGL AYYSMVGNWA KVLIVMLLFA GVDGVTYTTG GSQARHTQGV 
GAHWG 1 1 AGL AYYSMQGNWA KVAIIMVMFS GVDASTHVTA GO AARNAYG T 
G AHWGVLAG I AYFSMVGNWA KVLWLLLFA GVDAETHVTG GSAGHTVSGF 
GAHWGVMFGL AYFSMQGAWA KVWILLLAA GVDAQTHTVG GSTAHNARTL 
GAHWGVLAGL AYYSMVGNWA KVLWMLLFA GVDGGTHVTG GKVAYTTOGF 
GSHWGILAGL AYYSMVGNWA KVLIAMLLFA GVDGTTHVTG GAQGRAASSL 
GAHWGVLAGL AYYSMVGNWA KVLWMLLFA GVDGGTHVTG GKVAYTTOGF 
GAHWGVMFGL AYFSMQGAWA KWVILLLTA GVDAHTRS I A GSVAHATSGL 
GAHWG I L AGL AYYSMQGNWA KVAIIMVMFS GVDAHTYTTG GTAS RHTOAF 
GAHWGVLFAA AYYASAANWA KWLVLFLFA GVDANTRTVG GSAAQGARGL 
GGHWGVL I AI AYFGMSGNWL KVIAVLFLFA GVEATT . TVG RAAGRSAYLF 
GAHWGILAGL AYYSMQGNWA KVAIIMVMFS GVDAVTYTTG GSAAHATRGL 
GAHWG I LAGL AYYSMQGNWA KVAVIMVMFS GVDAETYITG GSAAHGVSTL 
GAHWGVLGAL LYFSMVANWA KVIAVLFLFA GADATT . YTG SAVSSTTGAF 
GGHWG I LAG I LYYSMVANWA KVLCILFLFA GVDATTRTTG AQAARATLGF 
EGHWGVIGAL LYYSMVANWA KVFAVLLLFA GVDATT . HIG S S AS ATTNRL 
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NDSLNTGFLA 
NDSFNTGFLA 
NESLDTGWLA 
NDSLNTGFVA 
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NESLNTGWLA 



450 

ALFYTSSFNS 
ALFYTHKFNA 
SLFYTHKFNS 
ALFYAHRFNS 
GLLYYHKFNS 
ALFYKYRFNA 
GLFYRHKFNS 
GLIYQHKFNS 
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ASLFSPGASQ 
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ASLFNVGPHQ 
TSIFTPGASQ 
VSLFAPGAQQ 
TSLFSSGAQQ 
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ASLFTFGAQQ 
TSLFSSGAQQ 
TGLFRPGASQ 
AGLFTPGPSQ 
ASLLTPGAKQ 
TSFFNPGPQR 
TGFFSLGPKQ 
TSLYTRGPSQ 
VGLLTPGAKQ 
TSFFTRGPSQ 
ASLFTQGASQ 
TSLFTPGASQ 
VSMFASGPSQ 
ASLFSAGSQE 
VSWLSQGPSQ 
VSLFSAGARQ 
TSLFSTGAKQ 
TSFFTQGPAQ 
AS FFTPGPAQ 
TSLFSVGAKQ 
VS LLAPGAKQ 
TGMFSLGARQ 
TSFFSRGPSQ 
TSLFSPGPVQ 
TSFFSRGPSQ 
AGLFTSGAKQ 
AGLFDIGPQQ 
ASLFTPGPQQ 
TSIFSSGPNQ 
TSLFSVGAQQ 
TSLFSSGPQQ 
VSLFSPGPTQ 
TGLFQTGAKQ 
TSFFSPGSKQ 
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SGCPERLAAC 
SGCPERMASC 
SGCSERLACC 
SGCPERMASC 
SGCPERMASC 
SGCPERLATC 
SGCPERLASC 
SGCPERLASC 
SGCPERMASC 
SGCPERLSSC 
SGCSERMASC 
SGCPERMASC 
SGCPQRLASC 
SGCPERIASC 
SGCPERMASC 
SGCPQRLASC 
SGCPERIASC 



RIQLVNTNGS 
NLYLINTNGS 
KIQLVNTNGS 
KIQLVNTNGS 
NIQLINTNGS 
KIQLVNTNGS 
KIQLVNTNGS 
NIQLINTNGS 
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NIQLINTNGS 
KLQLVNTNGS 
NLQLINTNGS 
KIQLINTNGS 
KLQLVNTNGS 
KLQLVKTNGS 
NLQLVNSNGS 
NIHLINTNGS 
NVQLIKTNGS 



WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 
WHINSTALNC 
WHINRTALNC 
WHINRTALNC 
WHINSTALNC 
WHINSTALNC 
WHINRTALNC 
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WHINRTALNC 
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WHINRTALNC 
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WHINRTALNC 
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WHINRTALNC 
WHLNSTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALSC 
WHINRTALNC 
WHINRTALNC 
WHINSTALNC 
WHINRTALNC 
WHINRTALNC 
WHINSTALNC 
WHINSTALNC 
WHINRTALNC 
WHINRTALNC 
WHINRTALNC 



NDSLHTGFLA 
NDSLQTGFLA 
NESINTGFFA 
NDTLQTGFLA 
NDSLQTGFIA 
NDSLNTGFLA 
NDTLQTGFLA 
NDSLQTGFIA 
NDSLNTGFLA 
NDTLQTGFLA 
NDSLNTGFLA 
NDSLQTGFLA 
NDSLQTGFLA 
NDSLQTGFIA 
NDS LNTGWLA 
NDSLQTGFLA 
NE S LNTGWLA 
NDSLNTGFIA 
NDSLQTGFLA 
NDSLNTGFLA 
NDSLQTGFLA 
NDSLNTGFLA 
NDSLQTGFIA 
NDSLQTGFIA 
NDSLNTGFIA 
NESLNTGFFA 
NESLNTGFFA 
NESINTGFIA 
ND S LNTGWLA 
NDSLHTGFLA 
NDSLNTGFLA 
NDSLNTGFVA 
NDSLNTGFLA 
NDSLNTGFIA 
NESINTGFIA 
NDSLQTGFVA 
IDSLQTGFLS 
NESINTGFIA 
NESINTGFIA 
NDSLQTGFIA 
NDSLNTGFMA 
NDSLHTGFIA 



RSIESFRIGW 
RSIDKFDQGW 
KSLDSYGQGW 
RSIDKFDQGW 
QPLTAFDQGW 
RPIDTFAQGW 
RRLTDFAQGW 
RRLTDFDQGW 
RPIDWFAQGW 
RGLDDFRIGW 
RPIDRFAQGW 
RPIDKFAQGW 
RSIDKFAQGW 
RSIDAFAQGW 
RPIDKFAQGW 
RSIDKFAQGW 
RSIDAFAQGW 



GSLEYEESVT 
GPITYAQPD. 
GPLGVANISG 
GPITYYQGD . 
GPITHEGNA. 
GPITYTEPH. 
GPISYANGS . 
GPISHANGS . 
GPITYTEPD. 
GTLEYETNVT 
GPITHAESR. 
GPITYAEPD . 
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GLFYYHKFNS 
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SLFYTHSFNS 
ALFYTHSFNA 
ALFYKYRFNA 
ALFYTHSFNA 
SLFYTYRFNS 
GLFYYHKFNS 
GLLYYHKFNS 
ALFYRSNFNS 
GLFYYHRFNS 
GLFYYHKFNS 
GLFARYKFNS 
ALFYLHKFNS 
GLLYAHRFNS 
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KP I TYFNQGW 
RSIDKFDQGW 
RPIDTFDQGW 
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RFIDEFDQGW 
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SPLDRFAQGW 
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GPLTDANISG 
GSLTDANVTG 
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RAEALLQVWI 
RVEAQLHVWV 
RGEAALQVWV 
RAEAHLQVWI 
IAEAHLQVWI 



PPLQARGGRD 

PPLNVRGGRD 

PS FNVRGPRD 

PPLNVRGGRD 

PPLNARGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLEVRGGRD 

PPLNVRGGRD 

PPLNIRGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLNVRGGRG 

PPLNVRGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PDLRVRGGRD 

PPLQVKGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLNVRGGRD 

PPLLVRGSRD 

PPLNVRGGRD 

PPLNIRGGRD 



GVIWAMTMFY 

AIILLMCAVH 

SVIVLAVLVC 

AIILLTCAAH 

GWLLTCVLH 

AIILLTCWH 

AVILLMCWH 

AVILLMCWH 

AI I LLTCAVH 

GIIWVAVILH 

AI I LLTCAVH 

AIILLMCAVH 

AI I LLTCAVH 

AI ILLTCALH 

AI I LLTCAVH 

AIILLTCWH 

AI ILLTCALH 

AI ILLTCAVH 

AVILLTCAVY 

AI ILLTCAVH 

SIILLTCLLH 

AVIMLTCLFH 

AIILLTCWH 

AI ILLTCAVH 

AVILLTCWH 

SVILLASLLY 

AI I LLTCAAY 

AI ILLTCAIH 



900 

PGWFDITKW 

PELIFDITKL 

PDLVFDITKY 

PELIFDITKL 

PHLLFEITKI 

PELIFDITKY 

PTLVFDITKL 

PTLVFDITKL 

PELIFDITKL 

PRLVFEVTKW 

SELIFDITKI 

PELIFDITKL 

PELIFDITKI 

PELIFSITKI 

SELVFEITKI 

PELIFDITKI 

PELIFSITKI 

SELVFEITKI 

PELVFDITKL 

PELIFDITKL 

PQLGFEVTKI 

PALGFEITKI 

PELIFDITKL 

PGLIFDITRL 

PALVFDITKL 

PSLIFDITKL 

PELIFDITKI 

PESIFDITKL 
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HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3a_CB 
TypeV_D 
VN004 
VN235 
VN405 



BEBE1 
D89815 
ED43type_4 
HC_C2 
HC_G9 
HCU1632 6 
HCV_H_CMR 
HCV_J1 
HCV_J4 83 
HCV_J8 
HCV_JK1 
HCV_JS 
HCV_K1_R1 
HCV_K1_R2 
HCV_K1_R3 
HCVJCIJSI 
HCV_K1JS2 
HCVJC1JS3 
HCV_L2 
HCV_N 
HCV12 083 
HCV14 80 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 



LIWWLQYFTT 
LIWWLQYLLT 
LIWWLQYFIT 
FIiWWTQYFIA 
LIWWNQYTIA 
LIWWLQYFIT 
LIWWLQYFIT 
LIWWNQYTIC 
CLWWLQYFLT 
FLWWLCYLLT 
LIWWLQYLIT 
FIWWLQYLIT 
LIWWLQYLIT 
SLWWLCYLLT 
LMWWNQYTIC 
VIWWLQYFIA 
AIWWNQYMLA 
LIWWNQYTIC 
LIWWNQYTIC 
ML WW I QYF I A 
FLWWNQ YF I A 
TLWWIQYYIA 

901 

LLAILGPGYL 

LLS ILGPLMV 

LLAILGPLHI 

LLAIFGPIMV 

MLAILGPLWI 

LLAI FGPLMV 

LLAIFGPLWI 

LLAVLGPLW I 

LLAILGPLMV 

LLAILGPAYL 

LLAI FGPLMV 

LLAILGPLMV 

VLAILGPLMV 

LVAILGPLMV 

LLAI FGPLMV 

MLAILGPLMV 

LVAILGPLMV 

LLAI FGPLMV 

LLAI FGPLMV 

LIAILGPLMV 

LLAVLAPLY I 

LFGILGPLYL 

LLAMLGPPMV 

LLAILGPLMV 

LLAIFGPLWI 

LIAVLGPLYL 

LLAILGPLMV 

LLATLGPLLV 

LIAILGPLMV 

LLAILGPLYM 

LLAILGPLMV 

LLALIGPLYL 

LLAILGPLYI 

LLAILGPLMV 

LLAILGPLMV 

LIAALGPLYL 

LLAVFGPLWI 



RAEADLHVWI 
RAEAHLHMWI 
RAEAHLQVWV 
HIEAKLHVWV 
RCEAALQIWV 
RAEAHLCVWV 
RAEAHLCVWV 
RCESALHVWV 
RVEAQLHVWI 
LAEAMVQEWA 
RAEAHLQVWI 
RTEAHLQVWV 
RAEAHLQVWI 
LGEAM I QEWV 
RCESALHVWV 
RVEAFIHVWV 
RAEAMIHDWV 
RCESALQVWV 
RCESALQVWV 
LTEAQLHQWV 
RSEALIQQWV 
RAEAMLYVWV 



PPLNARGGRD 
PPLNVRGGRD 
PPLNVRGGRD 
PFLRVRGGRD 
PPLLARGARD 
PPLNVRGGRD 
PPLNVRGGRD 
PPLLARGGRD 
PPLNVRGGRD 
PPMQVRGGRD 
PPLNVRGGRD 
PPLNVRGGRD 
PPLNVRGGRD 
PPMQARGGRD 
PPLLARGSRD 
PPLQVRGGRD 
PDLRVRGGRD 
PPLLARGSRD 
PPLLARGSRD 
PSLLVRGGRD 
PSLRVRGGRD 
PSLQVRGGRD 



AIILLMCAVH 
AIILLACAAH 
AI I LLTCAVH 
AI I LLTCVFH 
GIILLAGLFY 
AI I LLTCAAH 
AI I LLTCAAH 
GVILLTSLLY 
AVI LLMCAVH 
GIIWAVAIFC 
AI I LLTCAVH 
AI ILLTCWH 
AI I LLTCAVH 
G I I WAATMFC 
GVILLTSLLY 
AI IMLTCLFH 
AIILLTCLLH 
GVILLTSLLY 
GVILLTSLLY 
AVILLACLFH 
AVILLTCLLH 
AVILLTCLLH 



FRAAVMRTPY 
LQASLIRVPY 
LQASLLRIPY 
LQAGITKMPY 
LQASLLKVPY 
LQAGITRVPY 
LQASLLKVPY 
LQASLLKVPY 
LQAGITRVPY 
LKASLLRIPY 
LQAGLTRVPY 
LQAGITRVPY 
LQAG I AKVPD 
LQAGITKMPY 
LQAGLTKVPY 
LQAGIAKVPD 
LQAGITKMPY 
LQAGLIRVPY 
LQAGIIKMPY 
LQAGITRVPY 
LQYSLLKVPY 
LQHS LTKVPY 
LQAVITKVPY 
LQAGITRVPY 
LQASLLKVPY 
IQAALTSTPY 
LQAGLTRIPY 
LQAGITRVPY 
LQAGITRVPY 
LQAGITAVPY 
LQAGITRVPY 
LHAS LLRVP Y 
LQASLVRVPY 
LQAAITAMPY 
LQAAITAMPY 
IQATITATPY 
LQASLLKVPY 



FVRANALLRM 

FVRAQGLIRA 

FVRAQALVKI 

FVRAQGLIRA 

FVRAHGLIRL 

FVRAQGLIRA 

FVRVQGLLRI 

FVRVQGLLRI 

FVRAQGLIRA 

FVRAHALLRV 

FVRAQGLLRA 

FVRAQGLIRA 

FVRAQGVIRE 

FVRAQGLIRA 

FVRAQGLIRA 

FVRAQGVIRE 

FVRAQGLIRA 

FVRAQGL IRV 

FVRAQGLIRA 

FVRAQGLIRA 

FVRAHILLRA 

FLRARALLRL 

FVRAQGLIRA 

FVRAHGLIRA 

FVRVQGLLRI 

FVRAHVLIRI 

FVRAQGLIRA 

FVRAHGLIRA 

FVRAQGL IHA 

FVRAQGLIRA 

FVRAQGLIRA 

YVRAHALIRI 

FVRAHAWRL 

FVRAQGLIRA 

FVRAQGLIRA 

FVRAHVLVRL 

FVRVQGLLRF 



CALVKQLAGG 
CMLVRKAAGG 
CSLLRGVVYG 
CMLVRKVAGG 
CMLVRKTAGG 
CMLARKWGG 
CALARKIAGG 
CALARKMVGG 
CMLVRKVAGG 
CTLVKHLAGA 
CMLVRKVAGG 
CMLVRKVAGG 
C I L VRKVGGG 
CMLVRKVAGG 
CMLVRKVAGG 
CILVRKVGGG 
CMLVRKVAGG 
CMLVRKVAGG 
CMLVRKVAGG 
CMLVRKVAGG 
CLLVRRLAGG 
CLLAKHLVYG 
CMLVRKVAGG 
CMLVRKVAGG 
CALARKIAGG 
CMLVRSAMGG 
CMLVRKAAGG 
CMLLRKVAGG 
CMLVRKVAGG 
CMLVRKVAGG 
CMLVRKVAGG 
CALVQNVAGG 
CILVRNITGG 
CMLVRKVAGG 
CMLVRKVAGG 
CMLVRSVMGG 
CALARKMIGG 



PELIFDITKL 
PELIFDITKL 
PELIFDITKL 
PSLGFEVTKI 
PALVFD ITKL 
PELIFDITKL 
PELIFDITKL 
PSLIFDITKL 
PTLVFDITKL 
PGWFDITKW 
PELIFDITKL 
PELIFDITKY 
PELIFDITKL 
PGWFDITKW 
PSLIFDITKL 
PALGFEVTKI 
PHLGFEVTKI 
PSLIFDITKL 
PSLIFDITKL 
PQLGFEVTKI 
PSLGFDITKM 
PQLGFEVTKA 

950 

KYVQVAL ITL 
HYVQMAFVKL 
KYFQMWLKS 
HYVQMAFMKL 
QYVQMALLKL 
HYVQMVFMKL 
HYVQMAI I KL 
HYVQMAI I KL 
HYVQMAFMKL 
RYIQMLLITI 
HYVQMALMKL 
HYVQMAFVKL 
QYVQMAFMKL 
HYVQMAFMKL 
HYVQMAFVKL 
QYVQMAFMKL 
HYVQMAFMKL 
HYVQMAFVKL 
HYVQMAFMKL 
HYVQMAFMRL 
KYVQACLLRL 
KYVQAALLHL 
HYVQMALMKL 
HYVQMAFMKL 
HYVQMAI I KL 
KYVQMAVLTV 
HYVQMALMKL 
HYVQMAFMKL 
HYVQMAFMKL 
HYVQMAFMKL 
HYVQMAFMKL 
KYVQAAILRA 
KYVQMVLLAL 
HYVQMAFMKL 
HYVQMAFMKL 
KYFQMIILSL 
HYVQMVI I KL 
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HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3 a_CB 
TypeV_D 
VN004 
VN235 
VN405 



BEBE1 
D89815 
ED43type_4 
HC_C2 
HC_G9 
HCU16326 
HCVJH_CMR 
HCV_J1 
HCV_J483 
HCV_J8 
HCV_JK1 
" HCV_JS 
HCV_K1_R1 
HCV_K1_R2 
HCVJC1JR3 
HCV_K1_S1 
HCVJC1J32 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV12083 
HCV1480 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPCJK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3a_CB 



LLAVLGPAYL 
LIiAILGPLMV 
LLAIFGPLMV 
LLAILGPLMV 
LLAMLGPAYL 
LMAVLGPLYL 
LLGILGPLYL 
LLAILAPLYI 
LIAVLGPLYL 
LIAVLGPLYL 
LLALLGPLYIi 
LLALLGPLYL 
ILALLGPLYI 

951 

GKWTGTYIYD 
AALTGTYVYD 
RGLTGT Y I YD 
AALTGTYVYD 
GAFAGTYIYN 
AALAGTYVYD 
GALTGTYVYN 
GALTGTYVYN 
GALTGTYVYN 
GRWTGTYIYD 
AALTGTYVYD 
AALTGTYVYD 
AELKGTSVYD 
AALTGTYVYD 
AALTGTYVYN 
AELKGTSVYD 
AALTGTYVYD 
AALTGTYVYN 
AALTGTYVYD 
GALTGTYVYN 
GAWTGTFVYD 
GRLTGTYIYD 
AGLTS TYVYD 
AALTGTYVYD 
GALTGTCVYN 
GRWFNTYLYD 
AALTGTYVYD 
GALTGTYVYN 
GALTGTYIYN 
AALTGTYVYD 
AALTGTYVYD 
GSWTGTYIYD 
ARGFNTYLYD 
AALTGTYVYD 
AALTGTYVYD 
ADGSNTYLYD 
GALTGTYVYN 
GRWTGTYIYD 
AALTGTYVYD 
AALAGTYVYD 
AALTGTYIYD 
GRWTGTYIYD 
GRWFNTYLYD 
GRLTGTYIYD 
GAWTGT Y I YD 
GRWFNTYLYD 



LKGALTRVPY 
LQAGLTQMPY 
LQAGITRVPY 
LQAGITRVPY 
LRDALTRVPY 
IQATITTTPY 
LQYSLIKLPY 
LQHSLLKVPY 
I QAAVTTTP Y 
IQASITATPY 
LQYSLLKTPY 
LQVS LLRVPY 
LQYSLLKTPY 



FVRAHALLRM 
FVRAQGL IRM 
FVRAQGLIRA 
FVRAQGL IRA 
FVRAHALLRL 
FVRAHVLVRL 
FIRARALLRA 
FVRAHILLRA 
FVRAQVLVRL 
FVRAHVLVRL 
FVRAHILLRA 
YVRAHALLRV 
FVRAHILLRV 



CTMVRHLAGG 
CMLVRKAAGG 
CMLARKWGG 
CMLVWKAAGG 
CTMVRHLAGG 
CMLVRSVIGG 
CLLAKHLACG 
CMFFRKVAAG 
CMLVRSVMGG 
CMLVRSMMGG 
CMFFRGMARG 
CILVRRVAGG 
CMFLRGVAGG 



HLSPMSDWAA 
HLTPLQDWAH 
HLTPMSDWPP 
HLTPLRDWAH 
HLS PLQDWAH 
HLTPLRDWAH 
HLTPLRDWAH 
HLTPLRDWAH 
HLTPLRDWAH 
HLSPLSTWAA 
HLTPLRDWAH 
HLAPLQHWAH 
HLTPLRDWAH 
HLTPLQDWAH 
HLTPLQDWAH 
HLTPLQDWAH 
HLTPLQDWAH 
HLTPLQDWAH 
HLTPLRDWAH 
HLTPLRDWAH 
HLAPLSDWAS 
HLAPMKDWAA 
HLTPLQDWAH 
HLTPLRDCRH 
HLAPLRDWAH 
HLSPIQDWAA 
HLTPLQDWAH 
HLTPLQYWPR 
HLTPLRDWPR 
HLTPLRDWAH 
HLTPLRDWAH 
HLVPLRTWAS 
HLS PMTDWAA 
HLTPLQDWAH 
HLTPLQDWAH 
HLAPMQHWAA 
HLTPLRDWAH 
HLTPMSDWAA 
HLTPLQDWAH 
HLTPLRDWAH 
HLTPLRDWAH 
HLSPMSDWAA 
HLAPMQHWAA 
HLAPMKDWAA 
HLAPLSEWAS 
HLAPMQHWAA 



DGLRDLAVAV 

VGLRDLAVAV 

YGLRDLAVAL 

TGLRDLAVAV 

SGLRDLAVAT 

TGLRDLAVAV 

NGLRDLAVAV 

NGLRDLAVAV 

AGLRDLAVAV 

QGLRDLAIAV 

SGLRDLWAV 

TGLRDLAVAV 

TGLRDLAVAV 

TGLRDLAVAV 

TGLRDLAVAV 

TGLRDLAVAV 

AGLRDLAVAV 

TGLRDLAVAV 

TGLRDLAVAV 

AGLRDLAVAV 

DGLRDLAVAV 

SGLRELTVAT 

GGLRDLAVAV 

AGLRDLAVAV 

NGLRDLAVAV 

EGLKGLAVAT 

TGLRDLAVAV 

AGLRELAVAV 

AGLRDLAVAV 

AGLRDLAVAV 

AGLRDLAVAV 

DGLRDLAVAV 

EGLKDLAVAV 

AGLRDLAVAV 

AGLRDLAVAV 

AGLKDLAVAT 

NGLRDLAVAV 

NGLRDLAVAV 

AGLRDLAVAV 

TGLRDLAVAV 

AGLRDLAVAV 

SGLRDLAVAV 

AGLKDLAVAT 

SGLRDLAVAT 

DGLRDLAVAV 

AGLKDLAVAT 



EPIVFSPMER 

EPWFSAMET 

EPWFTPMEK 

EPWFSDMET 

EPVIFSRMEI 

EPWFSDMET 

EPWFSRMET 

EPWFSQMET 

EPWFSDMET 

EPWFSPMEK 

EPWFSDMET 

EPWFSDMET 

EPWFSDME I 

EPWFSDMET 

EPWFSDMET 

EPWFSDME I 

EPWFSDMET 

EPWFSDMET 

EPWFSDMET 

EPVTFSDMET 

EPVIFSPMEK 

EP I VFSAMET 

EPWFSDMET 

EPWFSAMET 

EPWFSRMET 

EPVIFSPMEI 

EPWFSDMET 

EPVI FSDMET 

EPWFSDMET 

EPWFSDMET 

EPWFSDMET 

EPWFSPMEK 

EPVIFSPMEV 

EPWFSDMET 

EPWFSDMET 

EPVIFSPMEI 

EPWFSQMET 

EPIIFSPMEK 

EPWFSDMET 

EPWFSDMET 

EPWFSDMET 

EPIIFSPMEK 

EPVIFSPMEI 

EPIIFSPMET 

EPVIFSPMEK 

EPVIFSPMEI 



RYVQMVLLAL 
HYVQMALMKL 
HYVQMVFMKL 
HYVQMAFVKL 
RYVQMALIiAL 
KYFQMIILSI 
RYVQAALLHL 
KYVQACLLRL 
KYFQMIILSI 
KYFQMIILSV 
RYAQAI LLRI 
KY I QAALLKL 
KYVQAALLRL 

1000 
KVIVWGAETT 
KVI TWGADTA 
KVIVWGADTA 
KI I TWGADTA 
KT I TWGADTA 
KVITWGADTA 
KL I TWGADTA 
KL I TWGADTA 
KVITWGADTA 
KVIVWGAETV 
KI ITWGADTA 
KI ITWGADTA 
KI I TWGGDTA 
KI ITWGADTA 
KI ITWGADTA 
KI ITWGGDTA 
KI ITWGADTA 
KI ITWGADTA 
KI ITWGADTA 
KI ITWGADTA 
KI ITWGADTA 
KVITWGADTA 
KI ITWGADTA 
KI ITWGADTA 
KL I TWGADTA 
KVITWGADTA 
KI ITWGADTA 
KI I TWGADTA 
KI ITWGADTA 
KI I TWGADTA 
KL I TWGADTA 
KVITWGADTA 
KVITWGADTT 
KI I TWGADTA 
KI ITWGADTA 
KVITWGADTA 
KL I TWGADTA 
KVIVWGAETA 
KI I TWGAETA 
KVITWGADTA 
KI ITWGADTA 
KVIVWGAETA 
KVITWGADTA 
KVITWGADTA 
KI ITWGADTA 
KVITWGADTA 
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TypeVJ) GRWFNTYLYD HLAPMQHWAA AGLKDLAVAT EPVIFSPMEI KVITWGADTA 

Zllt ^S*"™ HLAPLSTWAS DGLRDLAVAV EPVTFSPMEK KIITWGM 
VN405 GAWTGTYI YD HLTPLSDWAC DGLRDLAVAV EPWFSPMEK ^SSSS 



BEBE1 
D89815 
ED43type__4 
HC_C2 
HCJ39 
HCU16326 
HCV_H_CMR 
HCV_J1 
HCV_J483 
HCV_J8 
HCVjJKl 
HCV_JS 
HCV_K1_R1 
HCV_K1_R2 
HCV_K1_R3 
HCV_K1_S1 
HCV_K1_S2 
HCV_K1_S3 
HCV_L2 
HCV_N 
HCV12083 
HCV1480 
HCVPOLYP 
HD_1 
HPCCGAA 
HPCFG 
HPCGENANTI 
HPCGENOM 
HPCHUMR 
HPCJ 
HPCJCG 
HPCJK046 
HPC»JK049 
HPCJTA 
HPCJTB 
HPCK3A 
HPCPLYPRE 
HPCPOLP 
HPCPP 
HPCUNKCD 
MKC1A 
NDM59 
NZLI 
SA13 
Th580 
Type_3a_CB 
TypeV_D 
VN004 
VN235 
VN405 



1001 

ACGDIIHGLP 
ACGDIISGLP 
ACGD I IRGLP 
ACGDIILGLP 
ACGDIINGLP 
ACGDI ILACP 
ACGDIINGLP 
ACGDIINGLP 
ACGDIILGLP 
ACGD ILHGLP 
ACGDIILGLP 
ACGDIILGLP 
ACGD I IMGLP 
ACGD I ILGLA 
ACGD I I SGLA 
ACGD I IMGLP 
ACGDI ILGLA 
ACGDIISGLP 
ECGDI ILGYR 
• ACGDIILGLP 
ACGDILSGLP 
ACGNI LAVLP 
ACGDIISGLP 
ACGDIILGLP 
ACGDIINGLP 
ACGD I LCGL P 
ACGDIILGLP 
ACGDIILGLP 
ACGDIILGLP 
ACGDIILGLP 
ACGDIISGLP 
ACGD I LAGLP 
ACGDILCGLP 
ACGDIILGLP 
ACGDIILGLP 
ACGDILCGLP 
ACGDIINGLP 
ACGD I LHGLP 
ACGDI ISSLP 
ACGDI ILALP 
ACGDIILGLP 
ACGDVLHGLP 
ACGDILCGLP 
ACGD ILAGLP 
ACGDILRGLP 
ACGDILCGLP 
ACGDILCGLP 
ACGDI IAGLP 
ACGD I LAGLP 
ACGDIISGLP 



VSARLGQEVL 
VSARRGKEIL 
VSARLGNEIL 
VSARRGKEIL 
VSARRGREVL 
AS ARRGKE I L 
VSARRGQEIL 
VSARKGREIL 
VS ARRGKE IF 
VSARLGREVL 
VSARRGREIL 
VS ARRGRE I L 
VSARRGREIL 
VS ARRGKE I F 
VSARRGREIL 
VSARRGREIL 
VS ARRGKE I F 
VSARRGREIL 
SSARRGREIL 
VSARRGREIL 
VSARLGNLVL 
VS ARRGRE I F 
VSARRGREIL 
VSARRGREIL 
VSARRGQEIL 
VSARLGRELL 
VSARRGREIL 
VSARRGKEIL 
VSARRGKEIL 
VSARRGREIL 
VSARRGKEIL 
VS ARRGNL I L 
VSARLGKEVL 
VSARRGREIL 
VSARREREIL 
VSARLGREVL 
VSARRGREIL 
VSARLGREVL 
VSARRGRELL 
AS ARRGKE I L 
VSARRGKEIL 
VS ARLGRE I L 
VSARLGREVL 
VS ARRGHE I F 
VSARLGDLVL 
VSARLGREVL 
VSARLGREVL 
VAARRGNLLF 
VSARLGHLLF 
VS ARRGNL I F 



LGPADGYTSK 
LGPADSFEGQ 
LGPADTETSK 
LGPADSLEGQ 
LGPADALTDK 
LGPADSLEGQ 
LGPADGMVSK 
LGPADGMVSK 
LGPADSLEGQ 
LGPADGYTSK 
LGPADGFREQ 
LGPADSLEGQ 
LGPADSLEGQ 
LGPADSLEGS 
LGPADSFEGQ 
LGPADSLEGQ 
LGPADSLEGS 
LGPADSFEGQ 
LGPADSLEGQ 
LGPADSLVGQ 
LGPADDMQRG 
LGPADDIKTS 
LGPADSLKEQ 
LGPADSLEGQ 
LGPADGMVSK 
LGPADDYKKM 
LGPADSLEGR 
LGPADSLEGQ 
LGPADSLEGR 
LGPADSLEGQ 
LGPADSFGEQ 
LGPADDVKDK 
LGPADDYRSM 
LGPADSIEGQ 
LGPADSIEGQ 
LGPADDYREM 
LGPADGMVSK 
LGPADGYTSK 
LGPADS FDGQ 
LGPADSLEGQ 
LGPADSLEGQ 
LGPADGYTSK 
LGPADDYREM 
LGPADD IREA 
LGPADDMRHG 
LGPADDYREM 
LGPADDYREM 
LGPADDVKGK 
LGPADDMKSM 
LGPADD IRDG 



GWRLLAP I TA 
GWRLLAPITA 
GWRLLAP ITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAPITA 
GWRLLAP I TA 
GWRLLAPITA 
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Table 23. HIV Fusion Construct 



MGMQVQIQ^I*lXLLWVFaSRG*aVGKIWAQAAIIiaBP\^ 
KAKFVAAWTLKAAAKAFPVW'QWWAAKLTFLC^^ 
AAAAIFQSSMTiaCTTU'CAflDAKlWYNFQ^ 
WMNFKKAAAYPlASIJ^LJ^TFOWCFKLNRu^ 

VPLQWPlKAJfCTWPtEX. .„ ,. ' ... - 

ATGGOAATGCAOGTGCAGATC&AaAaajrGTTC^ 
GOAAAG<nXK}lX3K3<3CAAACTCVL^ 
QOTGAATOCOaCiraCCCTAAAarciACK^ 
A0CTX3CCAAAaCTAAOTTTOTG3<XX3 

c^aaaxx^GtnxKxnrcnraoaocK^^ 

1X3C^QTXKnV30CAOAG<3CCAWI^^ 
GCH3<KKX5CKn , GCAaCCATOTTX^^ 

COACCKHVAAGAACATOCCTTATAATC^^ , 
<XK}ACCTATr<3CrAACXTrOA<X^ 

QATQOCXXJTQlTrATTCACAATTTCAAAAAOaOCGCrrQCAT^ 
CXnXOTCAACCTOACATTCGGCTGGlGCOT 

ATCAATOOTAAAATXXAOAACrrCCOajTCrACrATAaaAAQGCTO 

GGCGGACAACnTGAAGAAAGTGCCTCTCX2AGCTGCCXXX7rc^ 

OCCTATCCCAGTCTGA 



r 
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Table 24. HBV GCR-3697 Fusion Construct 
GCR-3697 I Polynucleotide 



SEQID 
NO: 



GCR-3697 

SEQID 
NO: 



1 Start 
■f* * » 



^^^^^ 

GCCTCCTGGTGCCCTTCGT^^ 

CCGCAG 
CC 



:cct 



3ACTGGTGCCCTTCGTGCAGTGGTT 



2232 

Polypeptide 



HPHHHriHr 



4. 
744 
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Table 25. HBV AOSIb2 Fusion Construct 



PCT/US2004/009510 



HBV 
AOSIb2 


Polynucleotide 


SEQID 


1 Start 
t 


NO._ 


(AT^OAAT(K^GGTGCAGATCCAGAG<XTGTTTCTGCTCCTCCTGTGGGTQCCCOGGTCCAGAGGACACAr 
Gj^AGCGATTTCTTTCCrAGCGTGAACTTCCTGCTGTCCCTGGGAATCCACCTGTATATGG 

C ^ A £ A £ AC ^ CG J^^ 

CGGAGG ^ TGmAAGGTGOGAAA< ^ACCGGCCTGTATAACCTGCCCA^ 
GACCCTGTGGAAGGCCGGAATCCTGTACAAGAATGTGTCCATCCCTTGGACCCACAAGGGAGC^ 

4^stop 

657 


HBV 
AOSIb2 


Polypeptide 


SEQID 
NO:_ 


l 
t 

MGMQVQIQSUFLLLLWWGSRGHTLWKAG^ 
GNFTGLYNIPSDFFPSVTCTXWKAGILYK3STVSIP 

_ 219 
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Table 26. HCV Fusion Construct 



AnxlADARVWAmJHi^OYOAOV**^ 



444 



WO 2005/012502 

Table 27. Plasmodium falciparum Fusion Construct 
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MOMQVQIQSIJFLUXWWQSROF^ 

.EtKAALLACUQIAYKKBH^VBALF^ 

IMFl^AAAYFILVNUJKAAAn^VSSnJ^ 

FlKYYIPHQSSUCAAaiXOl^STVGAVLLQOVOLVD^ 

VNIlJKAFttFFDI^LVKAIJPFirFhlK^ 

n^8VITtANAVXAaiXQN\WQDBBNiaiYKAAA^ 

HVL8HNSYEKNAAAKYLVIYFLI 

GCX^XX^eCATaaaAATGCAGOTGCAGArcCA^^ ' 

GO ATCGAGAGG ATTTATG AAAO CTGTCTGTQTAQAG GTG AATGTAACATGCGGTAACG QAAT 

TCAGGTOAGAAA<^GACTCATCATGGTACTCAGCTIT<^^ 

TOACOGA^C^TaAAATCAAAaCOaCA^ 

OTTTCX3rtTTCX3TrOAAGCACTATlTAAGa<^ 

AACK^QCrCAQACTAATTTCAAAAO<XnrGOT 

TACAAAGCCXXXX2K3CGTG1XXGAQAATAT^^ 

erra\ATCTACTX^TAAAOqC^QOCX3^ 

CATAT<KXioo<x}Aacc»ocrcxjmx^oacracAGc^ 

TGAAAQCAGCIGTGTITrroATATTCTTTO^ 
AOTCTTAAAGCAGOC^aCTA(^ GOO AACXJTCICT 

<nro<3oc7iwro TTOAAC xnxxKxnq^ 

TCTCTGTTOCACATITCTAAAGCCQCATTCTATIT^ 
OATCTTCTTCOATCTATIXXaXXITAAAAGOOCTAT^ 

GOCAAaCAAOAAAATTOGTAClXJACTCAAGTTTQTAa AAG CTCIXnTWAGaAATACAACGCC 

GCraCTAAATTCGITtrcA.GCTraGACCCTGAAAaCAGCT 

TtOCTAATGKXXjrTATTAGGJAGG 

GCAT^AC^VAAGCCGCAGCACTGTACATrTCA 

CCTTCTGATAlTIGATAATGCAaCACTG.CC^ 

CnTGAGCCACAACTCCrACaAGAAGAACGCCGCOaTO^ 

TTGA - • v • 
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Table 28. Mycobacterium tuberculosis Fusion Construct 



2&1 

MQVQIQSl^LLLVATOSRORMSRVTTFJVK^ 

<ft2*ttAVYLVQAAAMALIJU^VKIUv4F^ 

LKAAAKAAARmiOTAAAGFVVALIPLW 

ATCK^CKrTG<^GATCX:AaAQCX7imTrCT<5 

ATOAOCAG AOTOAC^^TrCACTOTG^aOCCCra GTCK7TOCTGAT<KnXXXXXnXX>Ta AAC 
CTX3ATOAlXX30CAOCaCTGK3AOCCXJWP(m?AAAac^^ 
CK^QCKKnXJATOACAO<XXrroTA<X^X3aTO3aOTC«^ 
GTOAAOCOCATXmTaCroCAAATCTaaOAGTC^ 

GGAACKKnrcoaxirrca^ 

ACTCTOAAaGCAaCXXKXrAAOKk?^ 

ocrraaroaccxTraATimxT^^ 
<^K7rac^oc^ATa<KrrcrccixK»acTO 
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Table 29. Hepatitis B Virus Core Protein (SEQ ID NO: J 

MQLFHLCLIISCSCPTVQASKLCLGWLWGMDIDPYKEFGATVELLSFLPSDFFPSVRD 
LLDTASALYREALESPEHCSPHHTALRQAILCWGELMTLATWVGVNLEDPASRDLW 
SYVNTNMGLKFRQLLWFHISCLTFGRETVIEYLVSFGVWIRTPPAYRPPNAPILSTLPE 
TTWRRRGRSPRRRTPSPRRRRSQSPRRRRSQSRESQC 
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